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On the Addressing Problem 
for Loop Switching 


By R. L. GRAHAM and H. 0. POLLAK 
(Manuscript received March 25, 1971) 


The methods used to perform the switching functions of the Bell System 
have been developed under the fundamental assumption that the holding 
time of the completed call is long compared to the time needed to sel up 
the call. In considering certain forms of communication with and among 
computers the possibility arises that a message, with its destination at 
its head might thread its way through a communication network without 
awaiting the physical realization of a complete dedicated path before 
beginning on tts journey. One such scheme has been proposed by J. R. Pierce 
and may be called “loop switching.”’ We imagine subscribers, perhaps 
best thought of as computer terminals or other data generating devices, 
on one-way loops. These “local” loops are connected by various switching 
points to one another as well as to other “regional” loops which are in 
turn connected to one another as well as to a ‘‘national” loop. If a message 
from one loop is destined for a subscriber on another loop it proceeds 
around the originating loop to a suitable switching point where it may 
choose to enter a different loop, this process continuing until the message 
reaches its destination. The question naturally comes up, how the message 
is to know which sequence of loops to follow. It would be desirable for the 
equipment at each junction to be able to apply a simple test to the destination 
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address at the head of the message which would determine which choice 
the message should make at that junction. 

In this paper we propose a method of addressing the loops which has 
several attractive features: 


(1) It permits an extremely simple routing strategy to be used by the 
messages in reaching their destinations. 
(12) By using this strategy, a message will always take the shortest 
possible path between any two local loops in the same region. 
(zit) The method of addressing applies to any collection of loops, no 
matter how complex their interconnections. 


The addressing scheme we propose will be applied primarily to local 
loops where the mutual interconnections may be quite varied. If a certain 
amount of hierarchical structure is introduced into the regional and national 
loop structure, as suggested by J. R. Pierce,’ it is possible to achieve 
addressings which are both compact and quite efficient. 


I, INTRODUCTION 


The methods used to perform the switching functions of the Bell 
System have been developed under the fundamental assumption that the 
holding time of the completed call is long compared to the time to set 
up the call. It is thus sensible to hold portions of a route while the 
attempt is made to establish the connection. In considering certain - 
forms of communication with and among computers, as well as the 
consideration of many schemes for time division switching, the 
possibility arises that a message, with its destination at its head, might 
thread its way through a communication network without awaiting the 
physical realization of a complete dedicated path before beginning on its 
journey. 

One such scheme has been proposed by J. R. Pierce,’ and may be 
called ‘‘loop switching.’’ We imagine subscribers, perhaps best thought 
of as computer terminals or other data generating devices, on one-way 
loops. If a meassage is destined for a subscriber on another loop it 
proceeds around the originating loop to a suitable switching point where 
it may. choose to enter a different loop and continue the process until 
it reaches its destination. 

The question now comes up, how the message is to know which se- 
quence of loops to follow. A sufficiently complicated memory in the 
originating loop might, of course, look up an appropriate route, and then 
attempt to seize a complete path; but this is the old and perhaps in- 
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appropriate solution. It would be more convenient and sometimes 
preferable if the equipment at each junction could apply some simple 
test to the destination address at the head of the message which would 
determine which choice the message should make at that junction. 

In the nation-wide loop switching system as conceived by Pierce, we 
can envisage local loops, regional loops, and a national loop. The simplest 
imaginable structure is one in which each local loop has an interchange 
only with its regional loop and with no other loop; similarly each regional 
loop interchanges with the national loop and otherwise only with its 
local loops. How does it work? Suppose that a message originates in local 
loop X, and has its destination in local loop Y, where X and Y may or 
may not be identical. When the message comes to the interchange 
between X and X’s regional loop, it exits onto the regional loop if and 
only if Y # X. It later exits onto the national loop if Y’s region is 
different from X’s region; otherwise the message stays on X’s regional 
loop until it reaches Y. Therefore what should addresses look like? We 
see that if a portion of the loop address represents the regional loop, and 
another portion the local loop, routing decisions will be made on the 
basis of identity or nonidentity of certain portions of the sending and 
the receiving addresses. 

The loop configuration just described is perhaps too special to be 
practical. For example, it provides for no alternate routing, and for no 
special direct connections between two local loops with high mutual 
traffic. Pierce has shown how each of these difficulties can, to some 
extent, be alleviated. There remain, however, the further problems of 
the configuration of local loops belonging to a given region, and of the 
configuration of regional loops themselves. It is quite likely that the 
local loops attached to a given regional loop have many mutual switching 
points among themselves, so that calls within one region are not normally 
expected to use the regional loop. How should we address such local 
loops so as to make routing easy? Much of the rest of this paper will 
be devoted to this problem. We shall, in this and the next two sections, 
speak of “loops” generally, but mean a system of local loops as the most 
likely realization. We note in passing that a completely general national 
configuration of loops on which no hierarchical structure has been 
imposed will have the same addressing problem—but probably a much 
larger number of loops. We return to the hierarchical situation in Section 
IV. 

In some very simple arrangements of loops it is easy to see how 
addressing might successfully be accomplished. Consider, for example, 
Fig. 1 showing four loops which touch as if they were circles of radius § at 
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Fig. 1—Simple arrangement of loops. 


the vertices of the unit square. If the address of each loop were just the 
two-digit numeral 77, 7, 7 = 0 or 1, representing the coordinates of its 
center, then routing could be done in the following extremely simple 
manner: at each junction, go into the new loop if this decreases the 
Hamming distance* between where you are and your destination. If it 
doesn’t decrease the Hamming distance, don’t go. Thus, if you wish to go 
from loop 10 to loop 11 then the Hamming distance is 1. You will not 
take the exit from 10 to 00 if you reach it first, for this increases rather 
than decreases the Hamming distance. You will, however, exit into 11 
when you reach that junction. To go from 10 to 01 either exit, to 00 or to 
11, improves the Hamming distance and either routine is equally good. 

A simple potential routing scheme can thus be described as follows. 
Each loop has a binary address, n bits long. You make an exit from one 
loop to another if and only if it decreases the Hamming distance between 
where you are and where you want to go. If several exits do the same job 
then each one must lead to an equally short optimal path from sending 
loop to receiving loop. Furthermore, the number of loops traversed 
should, if possible, be exactly the Hamming distance between sender and 
receiver, with each transfer decreasing the distance from the receiver by 
exactly 1. 

Can such an addressing scheme be devised for every collection of loops 
with whatever adjacency structure? A little reflection shows that there 
will certainly be difficulties. Let’s think of the collection of loops ab- 
stractly as a graph, with each loop a vertex, and two vertices connected 
if and only if the two loops have a mutual transfer point. Thus, the 
graph of the previous example is as shown on Fig. 2. We have numbered 
each vertex with a pair of binary digits so that adjacent vertices differ 
in exactly one position, the number of edges required to pass from one 
point to another is exactly the Hamming distance between the cor- 
responding numberings, and all shortest paths between two points are 
achieved by following routes of decreasing Hamming distance to the 
destination. Another example (Fig. 3): if we wanted a collection of six 


* The Hamming distance between two n-place binary numbers is the number of 
places in which they differ. 
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oo 10 


Fig. 2—Graph of Fig. 1. 


loops arranged cyclically we could use the numbering 000, 100, 101, 111, 
011, and 010. We see that we are looking for a closed path on the 3- 
dimensional cube with the additional property that two points are 
exactly as far apart in Hamming distance as the number of edges to be 
traversed between them — otherwise, the routing logic would be ruined. 
Thus we can use the realization for a cycle of six loops shown in Fig. 4a. 
The realization shown in Fig. 4b, however, would not be a valid solution. 
In this latter picture, 100 and 110 have Hamming distance 1 and there- 
fore should be directly connected. The path between them, however, has 
length 3, the first link out increases rather than decreases Hamming 
distance, and therefore would not represent a useful addressing scheme. 

We thus see a difficulty caused by points coming too close together on 
the cube for the addressing scheme to work, but there are even deeper 
difficulties. Suppose we wish to construct an addressing scheme for a 
system consisting of three pairwise adjacent loops (see Fig. 5). This can 
never be drawn on a cube of any dimension. For any closed path of 
edges on a cube has even length, and 3 is odd. Is the scheme therefore 
kaput? 

Not quite. We can still imagine the 3-cycle embedded on a cube in an 
appropriate dimension (in this case a square) if we are willing to gen- 
eralize what we mean. We shall attach to A the code 00, B the code 10, 
and to C both 11 and O01. We shall denote the pair 11 and 01 by the 
symbol dl, where d means “‘don’t care.’”’ Hamming distance between two 
n-tuples of 0’s ,1’s, or d’s is computed by crediting 1 for every position at 


Fig. 3—Cyclic arrangement of six loops. 
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Fig. 4—Realizations for a cycle of six loops: (a) a valid solution; (b) not a valid 
solution. 


which one n-tuple has a 0 and the other a 1, and O for every other 
position. Thus, the Hamming distance between 01d1d0 and 11d010 is 2, 
with the contributions coming from the first and fourth positions. With 
this convention, the Hamming distance between any two of the three 
addresses 00, 10, and d1 is certainly 1, and correct routing still consists 
exactly of decreasing by 1 the Hamming distance at each junction at 
which a transfer is made. 

We now have a number of fundamental questions to answer. Can 
every collection of loops be numbered by assigning to each loop an address 
consisting of a sequence of 0’s, 1’s, and d’s? We require that every 
shortest route between two loops can be found automatically by moving 
from a loop to an adjacent one if and only if this decreases the Hamming 
distance to the final destination by 1. How many bits long would such 
an address have to be? Let’s state right away the fundamental theorem 
of this paper: Every collection of n loops, with maximum distance s 
between any two loops, can indeed be realized by giving each loop an 
address of no more than s(n — 1) 0’s, 1’s, or d’s. In fact, we know of no 
example where more than (n — 1) “bits” are needed, and we shall give - 
a construction that has found addresses no more than n — 1 bits long 
in every case on which it has been tried. The construction, however, is 
not quite an algorithm and we do not have a proof that it can always be 
done with as few as n — 1 bits. 


WITH ABSTRACT GRAPH 


A B 


Fig. 5—Three pairwise adjacent loops. 
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How is this routing algorithm going to work in practice? Here we have 
only the very earliest and simplest suggestions. The basic idea of the 
scheme is to obtain the greatest possible simplicity of routing strategy at 
the expense of the length of the loop address. Thus, for example, you 
could physically realize addresses consisting of 0’s, l’s, and d’s by 
encoding 0 as 00, 1 as 01, and d as either 10 or 11. The logic then says: 
If the 2k — 1st digit of both addresses is 0 then compute the Hamming 
distance between the 2kth digits. If the 2k — 1st digit of either address 
is 1, ignore it. Add up over all k, and see if going into the new loop 
decreases Hamming distance to the destination. This could be very easy 
to mechanize; the arbitrary bit following a 1 in an odd position could be 
used for parity checks or other purposes. 

It is not immediately clear who assigns the loop address to an in- 
dividual message. The “phone book” may contain a shorter code that is 
translated in the first junction you come to, or the sending computer 
itself may use the destination’s correct loop address. This problem is 
connected with that of system growth. How many numbers do you have 
to change if a loop is added to the system? The consequent desire for a 
hierarchical loop address structure is to a large extent fulfillable and will 
be discussed in Section IV. 

Before we proceed with the general theory, let’s see how a particular 
and not so simple example works out. Thus, consider the system of 
loops in Fig. 6. The distance between pairs of vertices is given by the 
following (symmetric) table: 


A B C D E F 


A|0 2 1 3 1 2 
B | 2 0 2 1 1 2 
Cj\1 2 0 2 1 1 
D|3 1 2 0 2 1 
EH} il 1 1 2 0 2 
F | 2 2 1 1 2 0 


We shall assign a sequence of five 0’s, 1’s, and d’s to each vertex in such a 
way that the Hamming distance between the 5-tuples corresponding to 
two vertices is exactly the distance in the table. One solution, as the 
reader should verify, is the following: 
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WITH ABSTRACT GRAPH 





Fig. 6—System of loops. 


A—111l1d 
B—001dd 
C—11d0d 
D—000d1 
E—10dd0 
F—010dd 


In the sequel, we shall see how such a solution can in fact be found for 
every possible system of loops. A really surprising amount of interesting 
mathematics seems, at present, to be involved in the problem. 

In order to see how a set of satisfactory loop addresses can always be 
constructed, let us analyze the previous example in more detail. The first 
column of the solution 1 is: 


A= 


LQ 


We see that A, C, and # have the value 1 at this coordinate while B, D, 
and F have the value 0. Thus, this coordinate will contribute a 1 to the 
Hamming distance from any of ACE to any of BDF. We may denote 
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this as ACH X BDF. Therefore, the first column makes the following 
contribution to the overall distance matrix. 


A B C D E F 





AjO 1 0 1 #0 1 
B/1 0 1 OO 1 0 
c}o 1 0 1 +0 1 
Dits i “I oO Sh. 20 
E/O 1 0 1 0 1 
Poul 0: F. 30: -4o °@ 


The second column may be written as ACF X BDE and contributes the 
following to the distance matrix. 


A B C OD E F 
Ae: a Oe ee. 
Bil 0 1 0 0 1 
clo 1 0 41 1 +0 
Dil 0 1 0 0 1 
Ej 1 0 1 0 0 1 
F|0 1 0 1 1 0 


The first two columns (i.e., coordinates) then contribute the sum of the 
previous matrices to the distance matrix. 


A B C D E F 


baw 


es . 
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The third column is: 


It will contribute 1 between A or B and D or F. Since C and E have the 
third coordinate value d, it cannot contribute to the Hamming distance 
from C or E to any other point. We can write AB X DF and obtain the 
following contribution to the distance matrix: 


A B C D E F 


Sy Qmw b 





The first three columns (coordinates) thus contribute the following to the 
distance matrix: 


A B C D EK F 





A 3 1 
B 1 1 
C 2 : 
D 0 1 
E 1 0 
F 1 2 
The last two columns are A X-C and D X E respectively. If the cor- 
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responding 1’s are added to the distance matrix, we obtain the matrix of 
our example. We see, therefore, that we can think of the distance matrix 
for our example as generated by the sum of the products ACH X BDF, 
ACF X BDE, AB X DF, A X C, D X E. Notice that in each product we 
assign a 0 to each element of one multiplier, a 1 to each element of the 
other, and ad to any possible multiplier which does not occur. Which set 
you make 0 and which set you make 1 does not matter. If we carry this 
out we obtain the coordinates for A through F given previously. 

The same mathematics works in general. Take the system of loops for 
which we wish to find an addressing scheme, and find the abstract graph 
in which each loop represents a vertex and two vertices are connected if 
and only if the loops touch. Now write down the (symmetric) distance 
matrix for this graph. If the vertices of the graph are A,, Ao, --- , A,, 
and the Hamming distance between A; and A; is d;;, then we may take 
d;; copies of A; X A; and then sum over all 7 and j. The contribution to 
the address of each A, will be d;; coordinates 1 to A;,0 to A;, and d to all 
other vertices. Therefore, the total contribution to the distance matrix 
will be d,;; in the (z, 7) position, and 0 everywhere else. Thus, the re- 
sulting complete set of coordinates for the A, will consist of 0’s, 1’s, 
and a’s calculated from each necessary copy of each A; X A; and will 
produce the desired distance matrix. 

This proves that the addressing scheme is always possible, but we have 
used a ridiculously large number of coordinates, perhaps 


sn(n — 1) 
2 


where s is the largest point-to-point distance in the distance matrix. We 
can save a factor n/2 if we take 


A; X (Ag, As, °°+ , An) 
+ A, x (Aj, ) A,, aaa ; Ay.) se Ay x (A,;, OS jAay) Gana 
where A;,, °°: A;:,, are all those vertices for which d,,;, 2 2, A;,,°°- 


A;,, are all those vertices for which d,,;, 2 3, etc. We then repeat 
for A, X (A;, -:- A,) -+- , and so on up to d,_;,,, copies of A,_, X A, . 
This time we have at most s(n — 1) products, and therefore have 
found a set of at most s(n — 1) coordinates for the A; such that loop 


addressing will work in the desired way. We have proved: 


Theorem 1: Gwen any system of n loops so that the maximum distance 
between any two loops ts s, a system of addresses such that every minimal 
path between loops 1s obtained by switching to an adjacent loop tf and only tf 
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the Hamming distance to the destination 1s decreased by 1 can always be 
found. The length of each address can be taken to be no more than s(n — 1). 

Let us remark right away that we believe the right answer to be 
(n — 1) rather than s(n — 1). We have no proof and we have no counter- 
examples. We will, however, prove the following theorems in the sequel. 


Theorem 2: If the abstract graph of the loop system is the complete graph 
on n vertices, then addresses of length (n — 1) are best possible. 


Theorem 8: If the abstract graph of the loop system 1s a tree on n vertices, 
addresses of length (n — 1) are best possible. 


Theorem 4: If the abstract graph of the loop system is a cycle of length n, 
then addresses of length n/2 are best possible if n is even, and addresses 
of length (n — 1) are best possible if n ts odd. 


II, MATHEMATICAL DEVELOPMENT 


Let us summarize what we have proven so far. Let (d;;) be the distance 
matrix of the abstract graph G with vertices A,. Let 


= 


(G) 


(Aj... ae Aiaua) x (Ap: ons Ay <') (1) 
a=1 

represent the graph G in the sense that A; and A; appear on opposite 
sides of products exactly d;; times. The number of coordinates which we 
must assign to each vertex of G is the minimum of V(G) over all de- 
compositions that satisfy the above conditions. 

The problem is equivalent to a problem in quadratic forms. Write 

> a; ;U,2; 
1sisign 


N 

= 2d, (ia beet Htin Sia Hott tia.) (2) 
Since d;; = 0, no single x, can appear in both factors of any single 
product. The equivalence is immediate since either decomposition will 
immediately yield the other. Our problem then is to find the minimum 
number N for any given quadratic form whose coefficients d;; are the 
distance matrix of a graph. We shall prove the following lemma due to 
H. 8. Witsenhausen. 


Lemma 1: Let n,, n_ be respectively the number of strictly positive and 
strictly negative eignevalues of the distance matrix (d;;). Then 


N = max (n,,n_). 
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Proof: Let Q(a1, «+: , %) denote the quadratic form Doicicien CijLsXj. 
As we have seen, the existence of a length k addressing of G is equivalent 
to the existence of a decomposition of Q into the sum of k products of the 
form (z,, + ++: + .2;,)(7;, + +--+ + 2%;,). But we see that 


k 


Q ae ys (in. + Soe + eee | Clam + eens + Die a) 
us 
1< 2 
= 4 Oe Liu, 1 7 + Vine cu) + Uiuis + eee, +- BS « Jess) 
u=1 
(Lins + oe af Vivir cu) = Live =—S 4 = Pee teal 


so that we have represented Q as a sum of k squares minus another sum of 
k squares. However, it is an easy consequence of the theory of quadratic 
forms (cf. Ref. 2) that for any representation of Q as a sum of p squares 
minus a sum of g squares, we must have 


p = index Q = n,, 


q 2 rank Q — index Q = 


examples equality seems to hold in the above lemma. However, most 
unfortunately, in general N # max (n,, n_). For the graph given in 
Fig. 7, we have n. = 1, n- = 5, but a computer search of possible 
decompositions has shown N = 


Therefore, k 2 max (n,, n_) and the lemma is proved. For most simple 


Lemma 1 is strong enough to settle the best N in many cases. In 
preparation let us prove Lemma 2, due to E. N. Gilbert. 


Lemma 2: If then X n distance matrix (d;;) 1s cyclic (meaning d;; = 
a(j — 2) mod n), then the eigenvalues of (d;;) are the values of 


n-1 
P@ = Laz 
0 


at each nth root of unity. 


Fig. 7—Graph for Lemma 1. 
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Proof of Lemma 2: Let the cyclic matrix M be 


Ao A °°" An-1 
M a Qn-1 Qo AN Qn-2 7 
ay a2 Ao 


If we try an eigenvector of the form 


I 
2 
‘ ) 
re 
then 
1 to + ae + -+* + a2" 
| ® | = |= Se a er 
go Qy + age + eee + age” 
If 2” = 1, then the latter matrix equals 
1 
(ay + ae + +++ + a,-2"')| * 
n—1 
Z 
Thus the values of ay + aiz + +++ + @,-12” if 2” = 1 are eigenvalues of 


the matrix M. Since they are n in number, and since / has only n 
eigenvalues, they are all the eigenvalues of M. 

Theorems 2, 3, and 4 may now be proved by using these lemmas. Let 
us prove a statement equivalent to: 


Theorem 2: If G ts the complete graph* on n vertices, then N(G) = n — 1. 


Proof: For this graph, d;; = 1,1 S$ 1 < j S n. The corresponding 
quadratic form is 
n-1 
2 Vil; y 
l1si<jsn 


*i.e., any two vertices of G are joined by an edge. 
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which is equal to 
n-1 


Dy Bi(@inr +--+ + 2). 


t= 


Hence N(G) S n — 1. To obtain an inequality in the opposite direction, 
we examine the eigenvalues of the (d;;) matrix. By Lemma 2, they are 
the values of 


P@=aze+2/4+--- 427" 
when 2"=1. But P(z)=[e(2""’—1)/(z—)], so that if z"=1 and 21, 
P(z) = —1.Ifz2 = 1, P() = (n — 1). Hencen, = 1, n_ = (n — 1), and, 
by Lemma 1, N(G) = n — 1. Hence N(G) = n — 1. 


Theorem 3: If the graph G is a tree* with n vertices, then N(G) = n — 1. 


Proof: We first examine the distance matrix D, for a tree with n vertices. 
Consider a terminal vertex v,, 1.e., a vertex which is distance 1 from 
just one other vertex, say v;. By a suitable relabeling we can assume2z = n 
and j = n — 1.* Thus, d,,, = 14+ d,-.,,forl1 S k S$ n — 1. Hence, 
the matrix D, has the form 





0 Gy: S88 dia) TE Gc] 

dis O te dana 1+ dons 
Dea=loi | a 
dines 0 1 

ie er ee ae if 0 | 


We wish to evaluate the determinant det (D,) of D, . Certainly we 
can subtract column n — 1 from column n and row n — 1 from row n 
of D, without changing det (D,). This leaves us with a matrix D, 
with the form 


O eee din-1 1 
1 
Di = 
Cissy Bei iar 1 
1 see 1 —2| 


*i.e., G is connected and has no cycles. 

t We have chosen 7 = n — 1 to simplify the exposition of the first part of the 
proof. In fact, any 7, 1 S 7 S n — 1, is acceptable. This generality is required later 
in the proof. 
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But, we now imagine removing the vertex v, from G, forming a tree 
G,-1 with n — 1 vertices. The interpoint distances in G, are given 
exactly by the upper-left (n — 1)-by-(n — 1) submatrix of D/ . As 
before, we can suitably relabel the vertices of G,_,; so that v,_, is a 
terminal vertex adjacent only to v,-. . The corresponding rearranged 
matrix D’’ now has the form 


0 dy» ans din—2 1 13 dim 1 


is O +++) donee I + don-o 1 
eS 7 
iss lets 0 . 1 

1 + djn-2 tee 1 0 1 

1 Yai 1 1 —92 


By subtracting column n — 2 from column n — 1 and row n — 2 from 
row n — 1 we obtain 


0 die : ; din-2 1 1 
diz 0 aes hee 1 1 


Div =|: : 
Ain-2 don-2 ++: O 1 1 

1 1 nee 1 —~2 0 

1 1 Sie, - eb 0 -2 


It is not difficult to see that this process can be continued until we 
reach the matrix 





fo 2 1 1 i 4 
1-2 0 0 0 0 
i 20) 222 30 0 0 
D¥=/1 O O —2 0 0 
i 0 <0: 0 ee 2a <0 
[1 0 0 O -:-:: 0 -—2 


The first (surprising) conclusion we draw is that det (D,) depends 
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only on the number of vertices n and not on the structure of the tree G. 
By expanding det (D*) along the last column it is easy to get the 
recurrence 


D, = (ly go as 2D,-1 ) D, = Q, D, =— S52 
from which it follows that 
= (-1)"(m — 1927, n2l. 


We next note that if we relabel the vertices of G, according to the 
relabeling used to get the matrix D*, in the corresponding distance 
matrix D,, (which is a permutation of the original distance matrix D,) 
the upper left-hand k-by-k submatrix D, of D, is just the distance 
matrix for some k vertex subtree of G. Hence, 


det (D,) = (-1)**(K — 1)2'?, k= 1. 
Finally, the sequence of determinants 
1, det (D,), det (D,), «++ , det (D,) (3) 
is just 
1,0, —1, 4, —12, 82, --- , (-—1)""'(m — 1)2””. 


Hence, the number of permanences of sign of this sequence (where 0 
is fixed as either positive or negative) is just one! By a theorem in 
matrix theory (cf. Ref. 2), the number of permanences in sign of the 
sequence (3) is exactly the number of positive eigenvalues of D, which 
we have seen is just one. Since D, , 18 nonsingular for n 2 1, then D, 
has no zero eigenvalues and hence, D,, must have n — 1 negative eigen- 
values. Therefore D, also has n — 1 negative eigenvalues and by 
Lemma 1, N(G) 2 n — 1. . 

The construction which gives N(G) S n — 1 has an easy recursive 
definition: Each time we choose the next vertex v; in the tree to assign an 
address to,* make sure that it is adjacent to a vertex v; which is already 
addressed, and let A(v;) — A(v;) 1 and A(v,) — A(v;) 0 for the previously 
addressed vertices (i.e., 1 and 0 are adjoined to the previous addresses). 
Thus, after all vertices have been addressed, all addresses will have 
length n — 1 and, in fact, no d’s are used. Therefore, N(G) = n — 1 
and the theorem is proved. 


Theorem 4: If G is a cycle on n vertices, then N(G) = n/2 if n is even and 
(n — 1) tf nts odd. 


* Where we assign 0 to the first vertex and 1 to the second vertex. 
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Proof: If n = 2m, then the vertices A, --+ Az», can be coordinatized as 
follows: 


IIA 


= m-+ 1, 


A,=(1-:-1,0---0) if 1Ss 
Cans 


and 
A,=(0::-0,1::-1) if m+28s8 8 2m. 
—O 
s—m—l1 2m—s4+1 


Clearly d;; = min ({z — 4], 2m — |z — j|) is the number of places in which 
A; and A; differ, and is the correct distance on a cycle. Hence N(G) S m. 
On the other hand, d,,,4; = m, and hence A, and A,,,, must differ in 
exactly m coordinates. Therefore there must be at least m coordinates, 
and hence N(G) = m. Thus N(G) = 
If n = 2m + 1, consider the following addresses: 
2m 


Ce ee 
m m 
freee ee 
A, —000---00---000 
A, —d00---00---001 


A, —dd0---00---O11 


Am, —ddd-+-d1-+-111 
A m+i7ldd: + -d1-+-+110 
An+2—dld---d1---100 
Am+3——dd1---d1---000 


Agm —ddd---10---000 


We see that: 
() fOSisjSmdij=j-% 
Gan es =) on, dig = 7-1; 


(2) fOS7t1Sm and j = m +s where s > 0, then consider sepa- 
rately 7 > s andi S s. If 7 > s then the first m coordinates 
contribute 0 and the second m contribute 7 — 7. If 7 S s the 
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first m coordinates contribute 1 and the second m contribute 
t+ 2m — j, so that together they give 2m + 1 — j + 7 which 
is the correct cyclic distance. 


We thus know that N(G) S$ 2m 
To prove N(G) 2 2m, we use Lemmas 1 and 2. 


P(@) = z+ 227 + +++ + me” + m2"? + (am — Ie"? --- 2, 


and we consider z such that 2?”** = 1. 


If z, = exp (2rik/2m + 1), then 


_ ~ Qyrk = 
PQ) = 2 2, j cosa » k = 0, 1, 2, , 2m. 


P(1) > 0; we shall prove P(z,) < 0 for all other k. We find that if we 
define 





Q4rx 
g(e) = Dy sin ES, 





then 
OS PonlLaees — COS 7X 
@) =} | 2m+1 __* 
g 2 is Tx 
2m+ 1 
Therefore 
goes tee eT Lae eee See x co ae 
; at I2m+1 mere Die | Dipset OMe 
g'(%) = 5 
in? oe 
2m+ 1 


is +/(2m + 1) times the desired series if x is 1, 2, --- 2m. But g’(z) < Oat 
all of these points. Hence n, = 1,n_ = 2m, and N(G) = 2m by Lemma 1. 
The theorem is proved. 


ItI. ADDRESSES OF MINIMUM LENGTH 


We describe an algorithm which is guaranteed to produce a valid 
addressing for any graph G. This algorithm has always succeeded in 
finding an addressing of length < n — 1 for every graph G on 7 vertices 
to which it has been applied. However, no proof that this will always 
happen is currently known. 


The algorithm proceeds as follows: 
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(<) Number the n vertices of G with integers {1, 2, --- , n} so that for 
k > 1, the vertex numbered k is adjacent to some vertex with a smaller 
number. Since G is connected, this is always possible. Let v(k) denote the 
vertex to which k has been assigned. 

(272) Assign the (partial) addresses of 0 to v(1) and 1 to v(2). 

(it) In general, suppose we have assigned (partial) addresses to 
v(1), v(2), --- , v(k), say, A(z) has been assigned to v(1), so that d;; = 
dy(A(2), A(¥j)), 1 S t <7 Sk, where dy, denotes the Hamming distance 
and d,;; denotes the distance between v(z) and v(j) in G. We next search 
for an address A(k + 1)(of the same length as the A (z)) with the property 
that maxicicr (dij,ns1 — dy(AQ@), ACK + 1))) = miss is as small as 
possible under the constraint 

min (d; 441 — dy(A(i), A(k + 1))) 2 0. (*) 


1sisk 


Of course, we can always find some address which satisfies (*), namely 
the all d’s address. Typically we can choose A(k + 1) so that m4, = 1. 
In fact, it is usually possible to do this by choosing A(k + 1) to be a 
slightly perturbed copy of some A(/) where v(/) is adjacent to v(k + 1). 
This is intuitively reasonable since in this case |d;,4.1 — d;,,| S 1. 

After A(k + 1) has been chosen, we then adjoin m,.; symbols to each 
of the partial addresses A(z), 1 S i S$ k + 1, as follows. To A(k + 1) we 
adjoin m,41 1’s. To A(z) we adjoin mz21 — (di,n41 — de(A(2), A(k + 1))) 
d’s and d; x41 — dy(A(2), A(k + 1)) 0’s. It is easy to check that for the 
new augmented addresses A’(z), 1 S 1 S$ k + 1, we have 


di, = dy(A’(2), A’(j)), 1 Ss a < j Ss k oF 1. 


We continue in this manner until the addressing is completed. By 
construction, the terminal addresses will form a valid addressing for G of 
length 1 + m3 + -+: +, 

As an example, we construct an addressing for the graph in Section I 
by this process. In Fig. 8 we show this graph with a particular ‘“‘adjacent- 
numbering” chosen and also the distance matrix for the graph. 

We start with 


vertex address 
1—0 
2—1 


Adjoining vertex 3, we see that any partial address of length one will 
give m; = 1. We choose 0. 
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Fig. 8—Addressing example. 


vertex address 
1—0 
2—1 
3—0 
We next adjoin ms; = 1 1’s to A(8) and augment A(1) and A(2) ac- 
cordingly. 
vertex address 
1—00 
2—1d 
3—01 


Now adjoin vertex 4, choose partial address 01, calculate that m, = 1, 
and augment the partial addresses accordingly. 


vertex address 


1—000 
2—1d0 
3—010 
4—011 
Continue this for two more steps. Each time m, = 1. 
vertex address vertex address 
1—0000 1—0000d 
2—1d0d 2—1d0dd 
3—0100 3—0100d 
4—0110 4—01100 
5—0111 5—O11ld 


6—ld1d1 
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The last array gives a length 5 addressing for G. Of course, different 
partial addresses or a different initial vertex numbering will result in 
different addressings for G. 

As we have previously stated, we have no general proof that 
N(G) Sn — 1 in all cases although a number of partial results in that 
direction have been given as well as a heuristic construction. 


IV. ADDRESSING IN RESTRICTED LOOP SYSTEMS 


The addressing scheme we have been describing has the very great 
power of being able to handle an arbitrary configuration of loops, and to 
provide alternate routing in an optimal way without any supervisory 
memory. The price we have paid for this generality is in the length of the 
address—typically n — 1 ‘‘bits’’ for n loops in the simplest encoding— 
and in possible complications under system growth. It is clear that if a 
new loop is added which greatly shortens the distance between many 
pairs of loops, then many addresses may change a good deal. There 
would be various ways of handling this, but it is obviously a problem. 
It arises essentially because the numbering in its full generality is not 
hierarchical. 

Typical Bell System loop configurations, as we noted in the intro- 
duction, will not be arbitrary collections of loops, but will have a 
hierarchical structure. 

By correspondingly restricting the allowable adjacency graphs G, it is 
possible to modify the routing algorithm and effectively take advantage 
of a natural “product”’ construction, as pointed out by J. R. Pierce." In 
this system, as we saw, loops are partitioned into three classes—national, 
regional, and local. The address portion of the message is subdivided 
into three corresponding portions. The routing algorithm now consists of 
three steps: (2) First apply the previous Hamming distance algorithm 
to the ‘‘national”’ portions of the sending and the destination addresses; 
(22) When the distance in 7 becomes zero, then apply the Hamming 
distance algorithm to the “regional” portions of the addresses; (iz?) 
Finally, when the distance in 727 is zero, apply the Hamming distance 
algorithm to the “local” portions of the address. 

This scheme combines the efficiency of the Hamming distance al- 
gorithm with the savings in address lengths resulting from the hierar- 
chical structure. As an example, the network in Fig. 9 has 44 local 
vertices. For a direct Hamming algorithm addressing we should expect 
addresses to have length of around 59. By distinguishing national, 
regional, and local loops (capital letters, lower case letters, and integers 
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235 «24 
22Q S 
W h O25 
26 
O) 
310 si 
300 
| > 28 
O 
29 
4D 
41Q *. 
f 2 3890 040 . Nas 
36 O 
35 39 feo 055 


Fig. 9—Network example. 


respectively), with a small additional computed cost in routing (several 
extra conditional transfers) we can have addresses of length < 11. 

For example, let NV, R, L denote national, regional, local, respectively. 
One possible addressing begins: 


N Rk L 


A—(00, 000 --—) 
B—(01, 000, ---) 
C—(10, 000, ---) 
D—(11, 000, ---) 
a—(00, 001, 000) 
b—(00, 010, 000) 
c—(00, 1d0, 000) 
d—(01, 001, 000) 
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1—(01, 001, 001) 
2—(01, 001, 010) 
3—(01, 001, 1d0) 


44—(11, 000, 001) 


Micreavan - add additional local stations to a regional station it is a 
very simple matter to modify just the neighboring local addresses to 
obtain a correct addressing for the augmented network. - 

The restriction on local loops in the above addressing is that each one 
must interchange directly with one and only one regional loop. If a local 
loop meets no regional loop directly, but only other local loops, then the 
addressing must make special provision for routing calls to other regions 
properly. If a local loop meets more than one regional loop—really a 
violation of the hierarchical concept—then routing becomes more 
difficult, and must assure that a call to a different region exits the local 
loop properly. As J. R. Pierce has pointed out,’ a special trunk loop 
connecting a local loop in one region to a local loop in another (i.e., a 
preferred alternate route in a special case to the national loop) is no 
problem. The exit from the local loop is just before the regional inter- 
change, and the entrance to the local loop just after. Exit is made only if 
- the total loop address matches exactly. Alternate routes more generally 
are perhaps most easily provided by duplicating portions of regional or 
natural loops. 


V. SOME VARIANTS OF THE ADDRESSING PROBLEM 


The purpose of this section is to record very briefly some other 
alternatives that have been considered. 

(i) We have required that in every alternate route between the ieobs; 
Hamming distance decrease by exactly 1 at each transfer. One could 
consider the alternate problem in which any exit which decreases 
Hamming distance is valid—even if it decreases it by more than 1. Under 
special conditions, this can lead to shorter addresses, but we do not 
have a solution for this alternate problem. 

(zz) Since the introduction of d’s causes some complication of the 
address codes, it is interesting to consider the possibility of getting rid of 
them. They arose originally because of the need for odd cycles, as in the 
case of a 3-cycle. One way out of this example would be to double all the 
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distances. If these vertices were located at 000, 110, and 101 respectively, 
the Hamming distance between any pair is two, and correct routing 
would be possible without any d’s in the addresses. 

Unfortunately, this technique of doubling all the distances to get rid 
of d’s does not generalize. Consider the graph in Fig. 10a. We double all 


Cc D Cc D E 


A OB A B 
Fig. 10—Graphs to illustrate one variant of the addressing problem. 


distances, so that AB = AC = AD = BC = BD = 2,CD = 4. Then 
A =00---,B=11--- , where the coordinates are identical from the 
third onward. Now C must differ from each of A and B by 2. It therefore 
must differ in one of the first two columns, and in one other, say the 
third. Thus, we may assume A = 000---,B=110---,C = 101 ---, 
where the coordinates are identical from the fourth onward. D must also 
differ in exactly two places from A and B and in four places from C. 
Hence A = 0000 ---, B = 1100:--,C = 1010---,D = 0101 -- 

So far so good. If we now require yet another point # (Fig. 10b) such 
that HA = EB = 2, EC = ED = 4, we have no possible coordinates for 
E left. E’s address must begin with 01 or 10 in order to differ from A 
and B by equal amounts, say with 10. To differ from C and D by equal 
amounts the first four coordinates must be 1001. But it now differs 
from A and B by 2 and from C and D by 2; no additional coordinates can 
make EC = ED = 4 without destroying EA = EB = 2. Thus-doubling 
distances will not get rid of d’s. 

Similar arguments show that even if we are allowed to multiply all 
distances by a fixed number m > 2, we still cannot get along without d’s. 
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Heavy Traffic Characteristics of 
a Circular Data Network 


By B. AVI-ITZHAK 
(Manuscript received April 26, 1971) 


Traffic behavior in the Pierce loop for data transmission is studied 
under assumptions of heavy loading. A deterministic mathematical model 
for describing traffic flows 1s developed and analyzed. The mathematical 
problem is of a linear complementarity type which has not been dealt 
with in the literature of mathematical programming. An effective procedure, 
the load-and-shift algorithm, for determining traffic flows is proposed. 
The procedure yields all feasible solutions for traffic flows and reveals 
the possibility of stations grouping into dominating classes and preventing 
other stations from using the system. This property, which can be elaminated 
by exercising appropriate control, also may affect the stochastic behavior 
of the system when heavy traffic conditions do not prevail and therefore 
deserves careful investigation. The paper includes two numerical examples 
illustrating use of the load-and-shift algorithm and numerical results 
from a simulation showing some of the effects of dominating classes when 
heavy traffic conditions do not prevail. 


I. INTRODUCTION 


The concept of a loop network for data transmission has been proposed 
recently by J. R. Pierce.’ In such a network the stations are connected 
to a closed loop main line on which one-way traffic is allowed. A message 
to be delivered from one station to another is arranged, at the sending 
station, into standard packets each carrying the address of the receiving 
station. These packets are then delivered onto the main line, one at a 
time, where they flow around in the allowed traffic direction. The address 
of each packet is checked at each station on the way until it reaches 
the receiving station where it is removed from the main line. Traffic 
on the main line cannot be delayed; therefore, a station can deliver 
a packet onto the main circular line only when permitted by the existence 
of a gap in traffic or when receiving a packet from the main line. Principal 
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features of the system may be explained with the aid of the four- 
station network shown schematically in Fig. 1. 

The four-armed structure revolves around the central axis and stops 
briefly every time the four packet-carrying compartments at the ends 
of the arms are aligned with the four stations. During such a stop 
each station is able to check the content of the aligned compartment. 
If the compartment is empty, the station can load it with a packet. 
If there is a packet in the compartment, it will not be removed unless 
it is addressed to the said station, in which case the station is permitted 
to load the compartment again after unloading it. A Pierce loop can 
be represented by the mechanical analog structure shown in Fig. 1; 
however, the number of revolving arms in the structure is not neces- 
sarily equal to the number of stations in the loop. Rather, the number 
of arms is determined by the “loop time” of the system (the time needed 
for a bit to complete one round on the loop). Significance of the loop 
time is discussed in more detail in Section V. 

In the Pierce loop it is currently assumed that no outside control 
is applied and each station strives to send its messages at the earliest. 
Therefore, a station will never miss the opportunity to load a compart- 
ment unless there are no messages waiting for delivery at the station. 

This paper presents a study of the flow characteristics of such a 
system in heavy traffic, i.e., when the system is not able to deliver all 
messages, and infinite queues build up at some stations. In Section II 
the notion of ‘‘stable solutions’ for the traffic flows is introduced and 


STATION 


3 
STATION STATION 
p /) i 


STATION 
1 


Fig. 1—A schematic description of a four-station Pierce loop. 
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formulated mathematically. The case of a totally saturated system is 
analyzed in Section III where the basis is laid for the mathematical 
development of the load-and-shift algorithm given in Section IV. A 
detailed description of a Pierce loop for data transmission is given in 
Section V where the alternating priorities effect due to dominating 
classes of stations is studied by simulation. This section also discusses 
the important aspect of the order of stations in the loop. Readers not 
interested in the mathematical elaborations may skip Section IV. 


II. MATHEMATICAL FORMULATION 


We assume that the flow direction on the main circular line is counter- 
clockwise. There are n stations, connected to the main line, numbered 
from 1 to n in counterclockwise increasing order. The segment of main 
line between the zth and the (2 + 1)th station, 2 = 1, 2, --- ,n — 1, 
is called the 7th branch. Similarly, the nth branch is the segment be- 
tween station n and station 1. Let p;; be the proportion of flow (packets) 
emerging from station 7 and destined for station j. 


dpi = 1. (1) 


The n X n square matrix P = {p;;} possesses all the properties of a 
stochastic matrix. Note, however, that the elements of P are not 
necessarily probabilities. 

The demand at station 7 is given by \; which is the average amount 
of flow (packets per time unit) generated at the station. The capacity 
of each branch equals 1, that is, each branch is capable of carrying 
a maximum flow of one unit. In the schematic description given in 
Fig. 1 assume that one full revolution takes four time units (in general 
it will take the number of time units equal to the number of arms) 
The capacity of each branch will then be one packet per time unit. 

The average flow emerging from station 7 will be denoted by 2; 
and the average flow in branch 7 by p; . Clearly 


OS 27; 52,, = 1D, on (2) 
and 
Gp i: SSO ees ow. (3) 


Let a;; be the proportion of flow emerging from station j and flowing 
through branch 72, then 
ai = pa Pik » (4) 


keSij 
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where 


yl ee if i>% 


Note that a;_:;—a@;;=p;; 2 0 for all 7 ¥ 7 where 7—1 is defined to equal 
n whenz = 1. For7 = 7 we havea,; = 1 and a,_-,; = p,;; S 1. In most 
reasonable applications p;; = 0. 

The average flow in branch 7 may now be expressed as a linear func- 
tion of X = (%,2%2,°°* , 2). 


pi DD a;%; S 1. (6) 
= 


Every X which is a feasible solution for the average flows must satisfy 
relations (2) and (6). The set of all feasible solutions is therefore con- 
tained in a convex polyhedral set. A central control could select a 
particular solution from this set to suit a given objective. 

In the circular network suggested by Pierce, however, there is no 
central control. Rather each station is striving to maximize its own 
flow onto the main line. For this case we define a stable solution as 
a solution from which the system will not depart without outside 
intervention. Suppose then that X* is a stable solution. Clearly X* 
must satisfy relations (2) and (6) and the additional condition that if 
pi = dot a;;e% < 1 then x* = i; . To show that this is a necessary 
condition assume that p; < 1 and x* < }, . However, station 7 strives 
to maximize its flow and can increase it as long as x* < i, and p; < 1. 
Therefore, it is not possible that p; < 1 and «* < i, . This additional 
condition is not generally sufficient for assuring that 2* is a stable 
solution. 

Insufficiency is best demonstrated by a simple numerical example. 
Suppose n = 4, \; = Ap = As = Aq = 2 and 
0 0 


0 
0 
1 





1 
1 0 0 
0 0 0 
0 0 1 0 
It is easy to verify that X° = (0, 0, 1, 1) and X* = (1, 1, 0, 0) both 
satisfy relations (2), (6), and the additional necessary condition as 
does X = aX° + (1 — a)X* for0 S a S 1. For simplicity we assume 
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that the flows generated at the stations are deterministic in nature 
and each station generates exactly two packets each time unit. 

Suppose now that the system (see Fig. 1) has one rotating arm only. 
In such a case there are only two stable solutions, namely X°, (a = 1) 
and X*, (a = 0). If the system has two rotating arms, an additional 
stable solution, a = 4, is added. If the system has k arms, there exist 
k + 1 stable solutions, a = m/k, m = 0,1, --- , k. The system shall 
settle for the stable solution a = m/k if at time zero m compartments 
contain packets from stations 3 and 4 and k — m compartments contain 
packets from stations 1 and 2. The necessary condition for stability 
is also a sufficient condition if the flows generated by the stations 
are continuous (as in the case of nonmixable fluids or small particles 
such as vehicles and a loop consisting of a pipe or a road). In such a 
case the set of feasible solutions is identical to the convex polyhedral 
set given by relations (2) and (6). _ 

We wish to find all solutions which satisfy relations (2), (6), and 
the necessary stability condition (these will be all the stable solutions 
in the case of a system with continuous flows). Our problem can be 
redefined as one of finding all feasible solutions to the following set of 
equations: 


x; + U; = r; ) 
n 
Dd, 4:52; +2, = 1, 2=1,2, 01+ Mn, (7) 
g=1 
uz, = 0, x; 2 0, Ui), 2, 2 0 


This form resembles a linear complementarity problem,” where u; and 
z, are slack variables. A feasible solution to the set of equations (7) 
is a basic feasible solution to the set of equations (2) and (6) since at 
least 7 of the variables must equal zero. 


III. COMPLETE SATURATION 


For sufficiently large values of \; (for example \; = 1,7 = 1, 2,--- ,n), 
all the branches are saturated and p; = 1,7 = 1, 2, --- , n. The set 
of equations (7) takes the form 


> a;%; = 1, ¢=1,2,-+-,n. (8) 
j=l 


This can be shown to be equivalent to the set of (n + 1) linear equations. 
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oo Die; = 0, t= Dare »n, (9a) 
i=] 
>» Xj d; = b. (9b) 
j=1 


The ith equation in (9a) is obtainable by subtracting the 7th equation 
of set (8) from the (¢ + 1)th equation of set (8). Equation (9b) may 
be selected as any linear combination of equations (8). Note that one 
equation in (9a) is redundent since >>"_, p:; = 1 for all 7. 


Remark: The physical interpretation of equations (9) is that when 
in complete saturation a station is able to deliver only when receiving. 
The flow emerging from a given station must equal the flow entering 
the station from the main line. This equilibrium relation is expressed 
by the n equations of set (9a). The (n + 1)th equation, (9b), expresses 
the capacity limitation of the branches. If we select d; = yo) de 
we have 


2D xz; d; =n, (10) 


where d; is the average number of branches (distance) traveled by a 
packet emerging from station 7. Equation (10) states that the average 
work (in terms of packets times distance) demanded from the system 
per time unit must equal n, since all branches are saturated and each 
traverses one packet per time unit. In matrix form we have 


x'p — oe Ge 
2d xz; da; = 0, (11) 


where X” is the transpose of Xx, (note that all vectors are defined to 
be column vectors). 

P is a stochastic matrix and, therefore, the problem represented 
by equations (11) strongly resembles one of determining the steady 
state probabilities of a finite state space Markov chain. The difference 
is that in the Markov chain problem d; = 6 for all 7 while in our problem 
this is not necessarily so. In the following we shall make use of this 
resemblance. 


Definitions: 


(zt) S; shall be used to abbreviate ‘“‘station 7.” 
(iz) S; is said to be accessible from S; , S; — S;, , if there exists a 
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sequence of elements of P such that P;.,P..2, °++ Pr,_.:,P:,; > 0. 
In such a case it is also customary to say that S; leads to S; . 

(iz) If S; — S; and S; — S; both stations are said to communicate 
(S; — S,). Clearly if S; = S,; and 8S, S; then 8; 8S, . 

(iv) Let C(z) = {S8,; : 8; S;}. Clearly if 8S; e C(2) then C(j) = C(). 

(v) A nonempty class of stations, C, is called a communicating 

class if for some station S; e C, C = C(z). It follows that two 
communicating classes are either identical or disjoint. 

(vt) A communicating class is closed if no station outside the class 
is accessible from a station in the class. 

(vit) A closed communicating class shall be called a dominating class 
or a class of dominating stations. A station not belonging to any 
dominating class is a dominated station. 


From the theory of Markov chains we know that there exists at 
least one class of dominating stations in a given Pierce loop. However, 
it is possible that there will be no dominated stations. In such a case 
all the stations are dominating and may form into one or more dominat- 
ing classes. It may be shown that the number of dominating classes 
is one if and only if there exists a station accessible from all other 
stations in the loop. In the case that dominated stations exist each 
must lead to at least one dominating station. 

Each dominating class of stations is represented by a principal sub- 
matrix of P (a dominating submatrix). This submatrix is obtainable 
by deleting all rows and columns of P corresponding to stations not 
belonging to the particular dominating class. Similarly, all dominated 
stations may be represented by one submatrix of P. 


Theorem 1. Let B beak X k submatrix of P representing a dominating 
class of stations, then all vectors Y satisfying the equation, 


Y’7B = y* (11a) 


form a linear space of one dimension. Furthermore, all the elements of Y 
must have the same sign, 1.e., either all positive, or all negative, or all zero. 


Proof. Theory of finite Markov chains (e.g., Kemeny and Snell’). 


Corollary. Assume that the dominating class represented by B is C = 
{Si,, Si, °°° , Sa} then there exists a unique vector Y* = (y*,y*%,-°- , 
y*,) satisfying equation (11a) and the scaling equation 


k 
> yt d;, = d. (12) 
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Clearly Y* > 0 and any solution to equation (1la) may be obtained 
by multiplying Y* by some real number. We define a vector X = 
(v, ,%2, ++: , %,) Where x; = Oif S;¢C, and 2, = y* if i = 7; . The 
vector X is called a dominating solution to equations (11) eamespondine 
to the matrix B and the class C. 


Theorem 2. Let Q beak X k submatrix of P oe dominated 
stations, then the only solution to Y"Q = Y" is Y = 


Proof. Theory of finite Markov chains. 


Theorem 8. Suppose that the matrix P contains exactly m dominating 


submatrices B,, B., -:+ , By representing dominating classes C,, C2, 
, C,, and suppose that X; 1s the dominating solution corresponding 
to B;,7 = 1, 2, ++: , m. X ts a solution to equations (8) if and only tf 
x = o> aX ; ) 
i=1 
and 
oes (13) 
i=1 
where a;,% = 1,2, +--+, mare real numbers. 


Proof. This is an immediate result of Theorems 1 and 2. 


We conclude that in the case of complete saturation, (op; = 1,7 = 
1, 2, --- , n), there always exists a nonnegative solution to our problem. 
If there exists only one dominating class of stations in the loop, there 
exists a unique solution to equations (8). If there is more than one 
dominating class in the loop, then there exist infinitely many non- 
negative solutions obtainable as convex combinations of the dominating 
solutions. 


IV. THE LOAD-AND-SHIFT ALGORITHM 


Returning to the more general case, we describe in this section the 
load-and-shift procedure for solving the set of equations (7) when 
pi S 1,7 = 1, 2,--- , n. The algorithm is based on the results obtained 
in the foregoing analysis. Assuming that the capacity of the branches, 
e, may be varied between 0 and 1; we start with « = O and increase 
it until e = 1. While so doing we simultaneously load the system and 
obtain the feasible solutions for any given value of 0 S « < 1. 


We shall start by outlining the procedure for finding just one solution. 


LOOP SWITCHING 2529 


Start: 7: = 0,20; PO 2S Py NOS 


Step 1: Increase r by 1. In P™ find a square submatrix B“”? repre- 
senting a dominating class of stations C’”’, C’’ = {S;, , 
S:, 7 OE og Sa}! 


Step 2: Find the unique positive vector Y = {y% 
satisfying the set of equations 


cy) Tp) = (y?) T 


3 ys” d; 


To determine the value of d,,, we select an i such that S;2C™” 
and let d;,, = @:,;, - 
Enlarge Y to the form X° = (af? , af , «++ , af), 
where a$” = Oif S;¢C™ and a? = y® ifj = 7, 
Step 3 (Load): Find a number A” such that 


(r) (r) 
A“ = Min et = an ’ 
where the minimization is over all z such that S; eC”. 
If A =>1—eset A” =1—e,set N = 1, and goto “Last 
Step.’ Otherwise increase e by A“ and continue. 

Step 4 (Shift): If r = n, set N = n and go to ‘Last Step.”’ Otherwise 
construct the (n — r + 1) X (n — r+ 1) square matrix P”*” 
by adding the jth column of P™ to its (j + 1)th column (if j 
is the last column, it is added to the first one) and then deleting 
the jth column and the jth row. M7*?: = MP — AMZ, 
t= 1,2, --- , n. End of rth iteration. Go back to “Step 1.” 


(r) (r) oy 


a1 ? Yi, d saa > Via 


1 


Last Step: The solution is 
N 
Se be AMPK”, (14) 
r=1 


STOP. 


Theorem 4: The procedure described above will awe ys yield a vector 
05 X S d\inat most n iterations. 


Proof: All possible matrices P‘” are stochastic. Therefore there always 
exists at least one dominating submatrix of P‘ denoted by B”. 
Since X‘” > Oand A” =O then X¥ = 0, AX” = 0. From the 


t A procedure for determining B“ is described in the Appendix. 
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algorithm we have \ = )0™., APX® +) = X + \%. The algo- 
rithm also ensures that \‘” = 0,r = 1, 2,--- , N, and therefore X < }. 
Since P is n X n the number of iterations cannot exceed n. 


Theorem 5 (Kzxistence): X obtained by the Load-and-Shift procedure 
is a feasible solution to equations (7). 


Proof: We enlarge Y‘’ by adding zero elements corresponding to 
columns of P‘” not included in B®’. The enlarged vector, denoted 
by Y°“” is a dominating solution to 

yp = y? 


Ss y,d; = 1, (15) 


geRr 


where R, is the set of indices of columns included in P“’. These equa- 
tions are equivalent [see equations (8) and (9)] to 


» ai?y; = 1, icR, . (16) 


ieRr 
It is easily verified, by the use of equation (4), that 
as? = 4; . (17) 


The vector X‘” is obtained by adding to Y°”” zero elements correspond- 
ing to columns of P not included in P‘”’. Therefore 


A™ SY a,jaf? = AM, forall ieR,. (18) 
j=1 


From the definition of a;; , equations (4) and (5), we know that 
@;-1; 2 a; for all j except 7 = 7, (wherez — 1 = nifz = 1). If 7¢R, 


then x{” = 0. It follows that 
A? Lage? <A”, forall ig¢R,. (19) 
j=l 


Summing equations (18) and (19) with respect to r and then substitut- 
ing equation (14) we obtain (for N < n) 


n N ; 
De Q;;%; => >S A” = 1, if eh, for all T= 1, 2, eee ,N, 

7=1 r=1 

n N 

D ait; S DA” = 1, otherwise. (20) 
7=1 r=1 


From the algorithm we know that if ze R, for allr = 1, 2,--- , N then 
x; < dr, . Otherwise x; = dr; . This completes the proof for N < n. 
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In the case N = nit is possible that >.*_, A“ < 1 (nonheavy traffic). 
The theorem still holds since x; = \; ,2 = 1, 2, --- , n. In the following 
we illustrate the use of the algorithm in solving for the flows in a 5- 
station loop: 


Numerical Example 1 








0 1/72 1/72 0 O| 0.5 | 

2/3 O 1/38 O 0 0.4 

P= /1/4 3/4 0 0 0 A = | 0.5 

1/4 1/4 1/4 0 1/4 1.0 

1/44 0O 1/2 1/4 OO] 0.8 | 

Table of Results 
C-domi- 
r | nating Class| 24°” Xo") x36") x4”) x56) Aw) € 

1| S1, Se, Ss. 0.6000 | 0.7000 | 0.5444 0 0 0.5714 | 0.5714 
2| Si, S3. 0.2500 0 1.0000 0 0 0.1953 | 0.7667 
3} S1, Ss, Ss. 0.3123 0 0 1.0000 | 0.2500 | 0.2333 | 1.0000 


The procedure terminated in three iterations yielding a solution: 
x, = 0.4646 
X2 = 0.4000 = r2 
x3 = 0.5000 = dz 
24 = 0.2333 
Xs = 0.0508 


Actually the algorithm finds the value of one variable in each iteration. 
When all variables equal to their respective \,’s have been determined, 
the algorithm finds in one iteration the values of all the remaining 
variables. It is therefore advisable to test the possible solution X = 
beforehand. 

We note that in the example there is a single dominating class of 
stations in each iteration (i.e., each matrix P“ has only one dominat- 
ing submatrix B“”’). It will be shown later that if P has only one dominat- 
ing submatrix then there exists a unique solution to equations (7). 
In the general case, however, P‘”’ may have more than one dominating 
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submatrix. This may give rise to the existence of infinitely many solu- 
tions. We would now like to improve the algorithm in order to be 
able to determine all feasible solutions. 


Theorem 6: If P™ has exactly one dominating submatrix, say B“”, 
then P"*” will also have exactly one dominating submatrix. 


Proof: Suppose 7 is such that S; e C™, then S; — S; for all 7. We 
apply the shift operation from 7 to k, (adding column 7 to column k 
and then deleting row and column 7), thus creating P”*”. Clearly 
now S; — S, for all 7. However, if there exists a station accessible 
from all other stations there exists exactly one dominating class in 
the loop. Furthermore C“*” = C(k). 


Corollary: If P has one dominating submatrix so will P™, r = 1, 2, 
iN. 


Theorem 7: If P“ has m > 1 dominating submatrices, P“*" will have 
either (m — 1) or m dominating submatrices. 


Proof (outline): Let Bi”, BS”, --- , BS? be the m dominating sub- 
matrices of P“ and let C{”, C$, --- , Ck? be the corresponding 
classes of dominating stations. Q‘” is the submatrix representing 
dominated stations. We apply the shift operation from 7 to k to obtain 
Pt», Without loss of generality we assume that S; e Ci”. For S, 
one of three alternatives must be true: 


(t) S, « C{”. In this case a dominating class will be formed, con- 
taining some or all the remaining stations of Cj”. Note that S, must 
be in the newly formed dominating class since it is accessible from all 
remaining stations of C{”. Those stations of C}” which are not included 
in the newly formed dominating class turn into dominated stations. 
The matrix P‘’*” will then have m dominating submatrices, namely 
BS”, BS, --- , B&@ and a newly formed one. 

(it) S, « Ci”, | ¥ 1. In this case every station in C{” is accessible 
from any remaining station of C{”. Therefore all remaining stations 
of C{” become nondominating, and P‘’*” will have m — 1 dominating 
matrices, namely BS”, BS”, --- , B&?. 

(722) S, is nondominating. S, must lead to at least one dominating 
class. If it leads to any dominating class other than B{” then this 
case becomes the same as (72). If S, leads to B{” only, then a new 
dominating zlass is formed. This class includes all stations accessible 
from S, . Remaining stations of B\”’ which. are not accessible from S, 
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become nondominating. P“*” will have m dominating submatrices, 
namely BS”, Bs, --- , BS’, and a newly formed one. 

This completes the outline of the proof. It is important to note that 
a dominating submatrix of P“’, say B{", will be a dominating sub- 
matrix of P“*” if 8; ¢C%. 


An Outline of the Complete Version of the Load-and-Shift Algorithm 


In the procedure for finding just one solution, outlined in the pre- 
ceding, we apply one load and one shift operation in each iteration. 
In the complete version of the algorithm we need to apply several 
such operations in each iteration. To eliminate possible confusion the 
superscript denoting the iteration number will be placed at the upper 
left side. Thus, for example, ‘’P is the stochastic matrix remaining 
by the beginning of the rth iteration. 


Definitions: Suppose ‘*’P has m, dominating submatrices, ‘’B, , 
“Be, -+: , ‘Ba, , representing classes ‘°C, , “Co. , --- , ‘Cm: - 
The matrix ‘’ P was obtained from “’ P = P by executing some sequence 
of load and shift operations. In order to keep track of these operations 
we relate to ‘”B; a set “EZ; containing labels. If the label “a; ¢ ‘E; 
we know that ‘’B; cannot be obtained unless the appropriate shift 
operation is applied to “’B; . Thus, when ‘“’E; , 7 = 1, 2, «++ , m,, 
are given, the exact sequence of load and shift operations that led us 
to ‘” P is known. 

The n dimensional vector ‘\ denotes the remaining unutilized 
flows at the n stations of the loop. The m, dimensional vector Me = 
(Pe , “Pe, «+: , “en,) describes the amount of branch capacity 
utilization. Thus “’e; is the amount of branch capacity utilized by 
the sequence of load and shift operations resulting in ‘B; . 

For each ‘’B,; it is possible to determine the dominating solution 
‘X; and the quantity ‘A; in the manner described in steps 2 and 3 
of the procedure outlined previously. 

We say that ‘’B; > “B; if when applying a shift operation on the 
appropriate column of ‘”B, all remaining stations of ‘’C; lead to 
stations of ‘C,; . If ‘°C; consists of a single station then ‘’B; > ‘B; 
if the shift is from this station to a station in ‘“’C; or to a dominated 
station leading to ‘’C; . The submatrices ‘’B; , 7 = 1, 2, --- ,m,, 
divide into three types: transient, terminal, and ring members. 


Ring: The set R = {‘B;, , “B;, , ++ , “Biy} is called a ring if 
it satisfies three conditions: 
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(2) 2 Fs ety OBS sh eek as ‘ORs men Bs ; 
(ii) “"B,,,j = 1, 2, --+ , k, does not lead to any dominating 
submatrix outside of the same ring, 
(wz) 1 — ia Geer oi A;;) > 0. 


Terminal: (i) If conditions z and 2 above are satisfied while con- 
dition 727 is violated the matrices ‘"B,, , “"B,,, --: , Ba 
are called terminal matrices. 

(i) If ‘PA; = 1 — “e; then “’B, is terminal. 
(wii) If ‘“"B; is terminal and ‘“’B; > “’B, then “’B; is 
terminal too. 


Transient: ‘ B, is called transient if it does not belong to a ring 
and is not terminal. | 


There is a strong similarity between the notion of a ring and the notion 
of a dominating class. To identify rings one may use essentially the 
same technique proposed in the Appendix for identification of domi- 
nating classes. 

It is important to observe that when applying the appropriate shift 
operations to all members of R, what remains of R will form one domi- 
nating submatrix and possibly some columns and rows corresponding 
to dominated stations will be left out. This property follows from 
Theorem 7. 

The following is an outline of the algorithm: 


Start: rn: = 0, PP: = P, Pr: =), Ve = 0, MH, : = 6. 


Step i: r:=r+1. 

Find all the dominating submatrices of “’P denoted by ‘’B, , 
By, +++ , °B,, . Calculate the numerical values of ‘A; 
and ‘ X,; , corresponding to ‘”B; , “A, and ‘"e;,7 = 1,2,-:-, 
m, . Find all the rings of ‘’P denoted by ‘R, , ‘"R.,--:, 
‘R,, . Identify the terminal and transient dominating sub- 
matrices of ‘’P. If all dominating submatrices of ‘P are 
terminal go to LAST STEP. If the number of columns of 
“P equals the number of its dominating submatrices who 
form into one ring, go to LAST STEP 1. Otherwise “A; : = 0 
for all ‘BB, not belonging to a ring. 


Step 2: Execute the appropriate shift operations to all ‘’B, belonging 
to rings and obtain “*” P. Note that in “*”P there will be 
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one dominating submatrix corresponding to each ring of ‘’P 
and, in addition, all terminal and transient dominating sub- 
matrices of ‘”P will be dominating in ‘*” P. 


Step 3: 
forty aa ()y = x Ph: xX, . 
i=l 


Cath) gs a a > (ars ae ar = 1, 2, : Ne _ ky ; 


(7: ByeC)R,) 


1 , . 
a Mes (= ar where Ca. => ay 5 t= i ++ 1, k, + 2, sgt 
Mert + 


aaa 58 aes U (PH, U ms. i= 1, 2 oat goles . 


(7 :(O ByeCOR,) 


C+D = OB where BR, = "B,,1 =k, +1,k, +2, °°° 
m,.1.GO TO STEP 1. 


Last Step 1: The unique solution is X = \. STOP. 


Last Step: N: = r. 
All the solutions are given by 


N mr 
(r) (r) 
pe a Sa 2 ee (21) 
r=1 i=1 
'N Mr E 
» DS or = 1, 0 Ss a, Ss A, ) Os = 0, r = 2, 
r=1 it=1 


unless “a, = “A; forall “a;e‘E;. STOP. 
Note that it is advisable to test the possible solution X = d beforehand. 


Theorem 8:' The load-and-shift procedure will yield vectorsO S$ X SX 
in at most n iterations. Each such vector, given by equation (21), ts a 
feasible solution to set of equations (7). 


Proof: The proof is practically identical to the proofs of Theorems 
4 and 5. 


The use of the algorithm is illustrated by a 10-station numerical 
example. 
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Numerical Example 2 











Ss So S, Ss Ss Sto 
0 0 0 0 0 0 

1/2 0 0 0 0 0 
0 0 0 0 0 0 

3/44 0 0 0 0 0 
0 0 0 0 0 0 
0 0 0 1 0 0 
0 1/4 0 O 3/4 O 
0 1 0 0 0 0 
0 0 14 0 0 3/4 
0 1/4 1/4 0 17 0] 


We first observe that “’P has m, = 3 dominating submatrices 


S: S. 8S, S 
” 0 1 0 
0 0 0 1/72 
1 0 0 0 
0 1/44 0 0 
pawp_|9 34 0 1/4 
0 0 0 O 
0 0 0 0 
0 0 0 0 
0 0 0 0 
Lo 0 0 0 
0.10 
0.46 
0.20 
0.20 
y= Oy — | 0-65 
0.50 
0.45 
0.60 
0.50 
a 
So 
S, Ss 
Op ate ‘| OB, =| 1/4 
+ 8 3/4 


The calculated results for the first 
following table. 


S, Ss 

1/2 1/2 Ss Ss 
a 0 1 

0 3/4 ’ Bz = 

1/4 0 ee 


iteration are summarized in the 


Results for First Iteration 








MC, = {S1, So} Oxo 0) 0 00 0. 00): “Ay Sot OB, > OB, | Tr a= 
Wee Ss S.So) | OX: = (0 4 0 3838 0 0 0 0 0) | Ma =0.33 | OB OB, De, a0 
MC, = {Se OxX¥;=(0 00 001 0 1 0 0) Ma; = 0.50 MB, > OB; Me = 0 
Tr—Transient; Te—Terminal. 
bed 7 -_ OBS; modes _ {Be 
sae OA a d, Es — d, Bs = d. 
Note: ‘A, will be set to equal zero before continuing. Shift opera- 
tions are performed from column 4 to 5 and from 6 to 7 in ‘’P. The 
resulting 8 X 8 matrix, ‘*”’ P, has three dominating submatrices. 
Si Sy Sy 
S, Ss; | 1/4 3/4 0 
OB = * : 1 Bs VASA: OF BIA sy OB SB, 
1 
” ie 1/2 0 
r= (0.10 0.20 0.20 0 0.37 0 0.45 0.10 0.50 1.00) 
Results for Second Iteration 
@C, = {Se, Ss} @X¥,=(0 100100 0 0 0) @A, = 0.20 2B, > @B, @e = 0.33 
Cz = {Sz So, Sio} | ®X,=(0 0 0 0 0 028 O 2438) | Ma, = 0.50 ©B,>®B, | Tel @e = 0.50 
@C3 = {81, Ss} @X¥;=(1 01000 0 0 0 0) QA; = 0.10 2B; — QB, @es = 0 





ONIHOLIMS dOot 


LEGS 


OR, = PB, BR, 
OR. = ere (2), = ae i. = d. 
Note: “A, will be set to equal zero before continuing. Shift opera- 


tions are performed from column 1 to 2 and from column 2 to 3 in “ P. 
The resulting 6 X 6 matrix, “’P, has two dominating submatrices. 


Ss 
@Bp = [1], sg oss = eB . 
“= (0 0 0.10 0 0.17 0 0.45 0.10 9.50 1.00) 
Results for Third Iteration 


@C, = {S83} OX 


1 @A, = 0.10 
MC, {S7, S9, Sio} @X> 


@A, = 0.50 


@B, > @B, 
@B, > @B, 


@e = 0.63 


Te} @e = 0.50 














say = {B}, 
sata EF = ag ; Oa, ; Past, saad 3 Se ead a 


Note: “A, will be set to equal zero before continuing. Shift opera- 
tion is performed from column 3 to 5 in “P. The resulting 5 X 5 
matrix “’P has two dominating submatrices 
Ss 
RB, = [1], B, = @ B, : 


“~=(0 0 0 0 0.17 0 0.45 0.10 0.50 1.00) 


SESS 


IL6l UAAOLOO “IVNUNOL IVOINHOGL WALSAS Tlad GHL 


OC, 
OC, 


{Ss} 
{S7, So, Sio} 


Results for Fourth (Final) Iteration 


OxX,=(0 000100 0 0 0) ®A, = 0.17 MB, > OB, Te Me = 0.73 
OX, =(0 0 0 0 0 0 38 O 34 38) MA, = 0.50 OB, > Be Te Me = 0.50 
On = oR. Qe, (0G en On = arene 


The iterative procedure terminates after completing the fourth itera- 
tion since all dominating submatrices of ‘* P are terminal. 
The set of all feasible solutions to the 10-station problem is 


(4) (4) (4) (4) (3) (3) (2) (2) (2) (2) 
Xx = ay X1 + Ae Xo + Qa) Xy + ay Xy + Q3 XxX 


(1). @) qa). @) 
+ Qe X» + a3 Xs ’ 
where 


(4) (4) (3) (2) (2) (1) (1) 
a, + Qe + a, + a + a3 + Qe + a, = 1, 


and 

0s “a $0.7, “a, = 0 unless “a, = 0.10. 

0< “a £0.50, “a. = 0 unless “a; = 0.50. 

0< “a, $0.10, “a, = 0 unless a, = 0.20 and “a; = 0.10. 
0s “a, < 0.20, “a, = 0 unless “a, = 0.33. 

0< “as S 0.10. 

0 < “a. S 0.33. 

0 < “Yas < 0.50. 


ONIHOLIMS dOOotT 


6E9¢ 
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It remains to show that the load-and-shift algorithm will yield all 
the feasible solutions to equations (7). 


Theorem 9: If X° = (x, a3, +++ , x°) isa feasible solution to equations (7), 
then 
N 
4; = 2 = Min = Dik} ) rsp (22) 
j=1 


Proof: The left-side inequality of (1) is part of equations (7). For the 
right-side inequality, suppose first that 2? < \; , then 


n n 
> ijt, = 1 and SS Wate <l. 
7=1 j=l 


Taking the difference we obtain 


2 (a5 — a;-1;)¢; = 0. (23) 
7=1 
From the definition of a;; we have that 
Qij— Oi1; = —p;:, forall 7 #1, 
and 
Qi —- G1; =1— Di. 
Therefore 


v; 2 > Disks : (24) 
j=l 
It follows then that 2? = i, if aa p;:t; = d, and therefore 


Bat a Min {Spal x}. 
j=l 


Theorem 10: Let X° be a solution to equations (7), and assume that \ > 0, 
then 


(i) If xf = 0, then x} = O for all i such that S; > 8; . 


7 


(it) If x} > O, then x! > O for all i such that S; > S; . 


7 


Proof: If S; — S; there exists a sequence p;,4, 5 Dis.ko) °° * » Dkr-aske 9 
Px,,i Whose product is positive. Suppose x? > 0, then min ()07,-; PnmXe 
An) 2 Diet; > 0. From relation (22) it follows that zx?, > 0 and simi- 
larly «2, > 0, +++ , 2? > 0. This proves 77. The proof of 7 is immediate. 


Remark: The assumption \ > 0 is not restrictive. If \; = 0, we apply 
a shift operation to the 7th column of P. 
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Theorem 11: Let B™ be a dominating submatrix of P and assume, 
without loss of generality that C = {S8,, S2, +++ , Sz} ds the class of 
dominating stations represented by B‘“. A and X™ are then uniquely 
defined. Suppose X° is a feasible solution to set of equations (7) then 


Either: (i) 22 = ax\,71 = 1, 2, Oe 0 ees ee 

or: (ii) a = AVA” ,¢ = 1, 2, +++ , k, where the inequality 
as strict for at a one value . Ui 

(iti) If i ts the case then x° = O for all S; ¢ C™ and leading 

to C”. If there exists such an S; and x! > 0, then ti is 


the case. 
Proof: Since B“ is a dominating submatrix then )>*_, p;; = 1 for 
hea eee 
(i) We assume that xz? < d; for? = 1, 2, --- , k and then sum the 


first k inequalities (24) to obtain 


n 


k k k 

pa ees > ws Dy Dit - 

i=) 1=1 g=k+1 i=1 
For this relation to hold it is necessary that x} >J'_, p;; = 0 for all 
j =k +1.1f C™ is not accessible from S; then >>*_, p;; = 0, other- 
wise either >>*_, p;; > O and therefore x? must equal zero, or S; > S,, , 
m2=k+41,and >>*_, p,; > 0 and therefore x2 must equal zero. From 
Theorem 10 we know that in this event x} = 0. Concluding then that 
2° = O for all {j: 8; ¢C } and 8; > C™ we obtain [in a manner similar 
to the one used in obtaining relation (24)] 


k 
0 . 
= > Pts 3 ea 1 2 seek, 
j=1 


yielding 


i BO. pe onl 09 


a 


Since 0 S$ x? < i, we have that 0 < a < A". It also follows that if 
there exists S; ¢ C‘” and leading to C‘” and x? > 0 then there exists 
at least one value of 7,7 = 1, 2, --- , k, for which x} = ),. 

(zz) Suppose there exists a value of 7,2 = 1, 2, -:- , k, such that 
a? = Ae, From Theorem 10 it follows that x? > 0 for alli = 
1, 2, --- , k. We select a number a, AX’ > a > 0, such that 


Min {a} — av;"} = 2, — ary = 0. 
1=1,2,°¢°,k 


Clearly if such an a exists then x?, < A‘ 2% and from relation (24) 
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we have 
Lin = DS Dinh} : (25) 
On the other hand we have 
ty, = arti? = Y piso”. (26) 


Subtracting equation (26) from (25) yields 


k n 
0= Dy Diin(S — ax;") + pp Dink . 
j=1 j=k+1 
Since S;,, 2 C‘” there exists a value of j, 7 ¥ 7, and S; e C“”, such that 
Diim > 0. It follows then that x? = ax? fori = 1, 2, --+ , m. This 
is a contradiction, which completes our proof. 


Corollary: Let B,, Bz, +++, Bry, be all the dominating submatrices 
of P = “YP, and let °C, , "Cz, «++ , Cn, be the corresponding domi- 
nating classes. If X° is a solution to equations (7) then eather 


x = > sare eee ys a se 1, 0 = ON a < say , (27) 
t=1 t=1 
or there exists at least one class of dominating stations, say “’C, , such 
that X° = (PA, X,. 


Theorem 12: If X° isa feasible solution to equations (7) it 1s obtainable 
by the load-and-shift algorithm. 


Proof: If X° is given by equation (27), it is obviously obtainable after 
one iteration of the algorithm. Otherwise there exists a class of dominat- 
ing stations, say ‘’C;, such that X° = “A;"X,;. We execute a 
shift operation involving the submatrix “’B; and obtain a stochastic 
matrix P® (note that P® # P). We let X°” = X° — ‘a,;X, 
and \” = x — a,"?X;. Clearly X° is a feasible solution to equa- 
tions (7) when using as parameters P™ and \® and replacing the 
right side by 1 — ‘A;. It is possible therefore to proceed with a 
sequence of load-and-shift operations until X° is obtained. Since the 
algorithm takes into account all possible sequences of load-and-shift 
operations, X° is contained in the set of solutions given by equation (21). 


Corollary (uniqueness): If P has exactly one dominating matrix then 
there exists a unique feasible solution to equations (7). Note, however, 
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that this 1s not a necessary condition for uniqueness. It is possible that P 
will have several dominating submatrices and the solution will be unique. 


V. DISCUSSION 


The heavy traffic assumption enables us to regard the system as a 
deterministic one. The analysis of the deterministic flows shows that 
stations tend to “band” into classes with the ability of dominating 
the system and preventing other stations from using it. 

This undesirable property, which can be eliminated by exercising 
appropriate control, also may affect the stochastic behavior of the 
system when heavy traffic conditions do not exist. One can imagine 
two classes of stations competing for domination of the system. Since 
traffic is not heavy, all the stations in the system are able to deliver 
their messages in finite time. Nevertheless, when one dominating class 
controls the system, it will prevent other stations from using the belt 
line until the queue at one of the stations belonging to this class becomes 
empty. At that moment the competing dominating class is able to 
take over and prevent other stations from using the belt line. This 
may result in a situation of alternating priorities (see Ref. 4) where, 
while one class is served, the queues at the competing class build up. 
While average queue sizes may not be strongly affected, the strong 
fluctuations in queue lengths may be undesirable. This possibility has 
been explored numerically by the use of a digital simulator.* The 
operating principles of the simulated system will be explained with — 
the aid of Fig. 2. 

Each of the stations is represented by a B-box. A packet coming 
out of a station is first multiplexed on the line by the B-box, provided 
the line is free, and then is passed from B-box to B-box until its destina- 
tion is reached. At each B-box on the way the address of the packet 
is examined. At the particular B-box of destination the packet is taken 
off the line. The main function of the A-box, shown in Fig. 2, is syn- 
chronization of the loop. 

Assume that the packets are made of L bits each and the address 
is given in the first & bits of the packet. A time unit in this system is 
the time it takes to multiplex a bit on the main line by a B-box. Assume 
also that the traveling time from one B-box to the adjacent one is 
zero. Station 2 is allowed to start sending a packet at times mL + ik, 
m = 0,1, 2, -+- , providing that the main line is free. The A-box is 
a buffer. Bits coming out of the B-box of station n accumulate in the 
A-box. Suppose at time & B-box 1 starts sending a packet. At time 


2544 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1971 


STATION 
3 





STATION 
1 


Fig. 2—A schematic description of a loop with n stations. 


k + 1 the first bit reaches B-box 2 and is delayed there until time 2k 
when the whole address of the packet has been received. If the packet 
is addressed to station 2, it will be taken out and at the same time 
B-box 2 can start sending out its own packet. If the packet is not ad- 
dressed to station 2, it will be sent from B-box 2 to B-box 3 where the 
same process will take place starting at time 3k. Bits arriving at the 
A-box are buffered. At time L the A-box starts to send bits (at the 
same rate as a B-box) until a whole packet has seen sent. If the buffer 
is empty the A-box will wait another L time units and will start sending 
at time 2Z. In general, the 7th B-box checks its buffer at times mL + 7k, 
m-= 0,1, 2, --- , and if the buffer is empty it may start sending its 
own packet. If the buffer contains the address of station 7, the B-box 
will remove the arriving packet and may, at the same time, send out 
its own packet. If the buffer contains an address different than station 
1, the B-box will pass on the arriving packet. 

In a similar manner the A-box checks its buffer at times mL, m = 
1, 2, --- ,. If the buffer is not empty, it sends out L bits. If the buffer 
is empty, the A-box remains inoperative for the next L time units. 

The loop time is the time it takes a bit to complete one round of the 
loop, and is measured in multiples of LZ. In the single A-box loop de- 
scribed here the loop time is the smallest integer greater than or equal 
to nk/L where n is the number of B-boxes in the loop. Clearly there 
is a complete analogy between the loop described here and the one 
presented in Fig. 1 if the number of revolving arms is taken as equal 
to the loop time. 
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It is important to note that the simulated system described here 
represents only one conceptual way in which the loop may be operated. 
It is possible, for example, that the packets will move from one B-box 
to another in one block rather than bit by bit. This is equivalent to 
placing an A-box between any two B-boxes (n rotating arms). 

We have used the digital simulator to examine the queuing char- 
acteristics of several small systems. The main purpose of the simula- 
tion was to study the effects of dominating classes in nonheavy (non- 
saturated) traffic situations. Numerical results are presented for an 
8-station loop with two dominating classes C,; = (S;, S2, S3, S,) 
and C, = (S;, Ss, S;, Ss). The loop time for this system was selected 
to equal 1 (one rotating arm) and the system was simulated for three 
different expected main line loads (utilization). The P matrix and 
average queue sizes (in packets) at the stations are shown below. 


Simulated Example: 8-Station Loop (Two Dominating Classes) 
Si S. Ss S, Ss; Se 8S, Ss 








S| 0 1/3 1/3 1/3 1 0 0 0 0 
S|1/3 0 1/3 1/3 1 0 0 0 0 
S3| 1/3 1/3 0 1/31 0 0 O 

pee 2 oO 
S| 0 O O 0 ! 0 1/3 1/3 1/3 
S| 0 0 0 0 1/3 0 1/3 1/3 
S| 0 0 O O 1 1/3 1/3 0 1/3 
SLO OO 0 011/38 1/3 1/3 0 

Line and 

Source Average Queue Sizes 


No.| Utilization | S: SS. S3; Ss Ss S¢6 Sy Ss | Ave. | Max. 


1 | \ = 0.239, | 9.7 10.1 18.8 6.7 15.6 14.6 11.2 6.8 { 11.1 | 33-58 
p = 0.956 

2 |X = 0.229, | 5.6 5.6 6.1 3.0 5.0 6.8 5.2 3.2 5.1 | 26-45 
p = 0.916 

3 |X = 0.213, | 2.4 2.6 1.9 1.1 2.4 2.8 2.1 1.2 2.0 | 16-22 
p = 0.852 





The alternating priorities effect, due to domination, is demonstrated 
in Fig. 3. The total number of packets at the four queues of C, (dotted 
line) and C, (solid line) were plotted against time. For the case p = 


SUM OF QUEUE SIZES AT C; = og 
—--- SUM OF QUEUE SIZES AT C2 = (Ss55g>S7> Sa) 


QUEVE SIZES 





[I 
0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 
TIME 


Fig. 3—Simulated queue sizes. 


0.956 (high utilization) one can clearly see that only one dominating 
class is served at one time. When stations of C, take over control 
of the system the queues at stations of C, build up while at C, they 
are being depleted until the queue at Ss reaches zero and C, can take 
over. The average cycle time (time elapsing between two consecutive 
peaks of the dotted or solid lines) for this case was 110 time units. 
As the load on the system is decreased, the alternating priorities effect 
becomes less and less distinctive. For p = 0.916 (not graphed) the 
average cycle time reduces to 23 time units, and for p = 0.852 (see 
Fig. 3) alternations are very frequent and cycles are practically un- 
noticeable. 

Notwithstanding the complete symmetry within classes the average 
queue sizes at S, and Sx, are consistently smaller than the average 
queue sizes at the other stations. The explanation of this phenomenon 
is as follows: At a moment when C, loses control to C, the queue size at 
S, 1s zero while at S,, S., and S; it is greater than or equal to zero. 
From that moment on the queues of C, build up at equal average rates 
until the moment control returns to C, (peak of the dotted line in Fig. 3). 
At that moment the queues start being depleted at equal average 


t Note that the queue size at Ss being zero is a necessary but not sufficient condition 
for losing control. 
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rates. Since the expected queue size at S, is smallest, it has a higher 
probability of being completely depleted first. This phenomenon tends 
to shorten the alternations cycle. 

The alternations cycle will be shorter when the number of rotating 
arms (loop time) is increased. Therefore large loops (local or regional 
loops) will be relatively more stable when high utilization occurs. 

The nature of the stochastic process used for generating packets 
at the stations of the loop is described in Ref. 5. 

An important aspect, not analyzed in this study, is the question 
of the order of stations in the loop. Clearly, the amount of traffic the 
loop can carry and the resulting congestion are strongly dependent 
on the specific order of the stations in the loop. In Example 1 we have 
assumed counterclockwise traffic direction. If we reverse the direction 
of traffic on the main line we shall get a different solution for the flows. 
The two solutions are compared in the following: 


Counterclockwise Clockwise Flow 
Flow Direction Direction 
x, = 0.4646 x, = 0.5000 = d, 
Lg = 0.4000 = de Lg = 0.4000 = de 
x; = 0.5000 = d, zz; = 0.5000 = dz 
X4 = 0.23383 x, = 0.0917 
x; = 0.0508 x; = 0.3667 
Total: 1.6487 1.8584 


Reversing the flow direction results in an increase of 13 percent in 
the amount of satisfied demand. 

In a practical situation not all orders are feasible. Still the number 
of feasible orders may be overwhelmingly large and an appropriate 
algorithm for determining best order is called for. An interesting pos- 
sibility is a double loop system where each station is connected to two 
loops with opposite traffic directions. This may increase reliability 
and enable better utilization by allocating traffic in an efficient manner. 
One possible allocation rule is shortest distance allocation where the 
loop to be used is the one with the shortest travel distance for each 
particular message (this is an example of a possible rule and is not 
proposed as an optimal rule). 

The bounded linear complementarity problem presented by equa- 
tions (7) is of somewhat more general interest, bearing little relation 
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to the Pierce loop. In matrix form we have 
X+U =), 
AX+2Z =1, 
UZ = 0, U=0, Z20, X20. 


Substituting X = \ — U we obtain the same set of relations in a slightly 
different form. 


AU-—-Z=AdN-1=¢4q 
U"Z = 0, U>20, Z=20, Us)». (28) 


I 


It is now possible to compare our problem to the Fundamental Problemt 
treated by Lemke,° and Cottle and Dantzig.” 


AU-—-Z=4q 
U’Z = 0, U = 0, Z=0. (29) 


The only basic difference between the two problems is that in our 
problem U is bounded from above while in Lemke’s problem U is 
unconstrained. In this respect our problem is more general. The shift- 
and-load procedure is, however, fundamentally based on the specific 
structure of A and g and may not prove useful for a wider class of 
parameters. 
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APPENDIX 


In this appendix we outline a procedure for determining all dominat- 
ing submatrices of a given stochastic n X n matrix P. 


Step 1: Construct a matrix I = {2;;} where 
run if pis > 9, 
0 otherwise. 


t The term “Fundamental Problem” was coined by Cottle and Dantzig. 2 
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Step 2: Q = {qi;}: = II. 


Do for7 = 1,2, --: ,n. 
Repeat until q; ,k = 1, 2, °-- ,, remain unchanged. 
p= Get 2 Qki hoe 12, 
(Note that all additions are Boolean.) 
If S; — S, then in the resulting matrix Q the element q,;; = 1. 
Otherwise q;; = 0. 


Step 3: Construct a matrix Q® = {q?,} such that 


qi; = qe = = O:iQiz + 
: S; oe not belong to a communicating class then gi!) = 0, 
= 1, 2, -++ , n. Otherwise gf? = 1 and S; e C() if and only 


7 @ = = 1. C(2) is closed (dominating) if and only if the ith 
row of Q™ is identical to the ith row of Q. 
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Bending Losses of the Asymmetric 


Slab Waveguide 


By D. MARCUSE 
(Manuscript received April 30, 1971) 


The bending losses of the asymmetric slab waveguide are computed. 
The computation is based on the knowledge of the exact form of the solution 
of Maxwell’s equations of the bent structure and the additional assumption 
that the field near the bent waveguide can be approximated by the field of the 
straight waveguide. The result of this theory is in good agreement with an 
existing theory. It appears that the bending loss formula can be used to 
estimate the bending losses of the round optical fiber if the mode parameters 
entering the formula are replaced by the corresponding mode parameters 
of the round fiber. We present curves that allow the numerical evaluation 
of the bending loss of the lowést order even TE mode of the symmetric slab 
waveguide. 


I. INTRODUCTION 


E. A. J. Marcatili has shown that a bent slab waveguide loses power by 
radiation.’ His analysis is based on a solution of the eigenvalue equation 
of the bent waveguide. It is possible to derive the expression for the 
bending losses from an approximate theory that is much simpler than 
the solution of the eigenvalue equation. We use this method to derive the 
formula for the bending losses of an asymmetric slab waveguide. The 
symmetric slab waveguide is, of course, included in this treatment as a 
limiting case. The result of this approximate theory is in very good 
agreement with the theory of Marcatili. Furthermore, if the parameters 
of the HE,, mode of the round optical fiber are used in the slab wave- 
guide formula, loss values are obtained that agree well with experimental 
loss values for this mode.” 

The bending loss theory presented in this paper is based on the fol- 
lowing idea. A bent slab waveguide can conveniently be described in a 
cylindrical coordinate system whose axis coincides with the center of 
curvature of the waveguide (Fig. 1). The solution of Maxwell’s equations 


2551 


2552 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1971 






/ 
CENTER OF 
CURVATURE 


Fig. 1—Bent slab waveguide with a cylindrical coordinate system centered at the 
center of curvature. 


in the cylindrical coordinate system is known so that the shape of the 
field distribution of the curved waveguide is known except for an un- 
determined amplitude factor and for the value of the order number of 
the cylinder function of the solution. Both of these unknown parameters 
can be obtained if we assume that the field in the vicinity of the wave- 
guide must be similar to the field of the straight guide as long as the 
radius of curvature is large. The approximate solution is used to calculate 
the power that is radiated from the waveguide so that the power loss 
per unit length caused by the waveguide curvature can be determined. 
This procedure leads to a simple equation for the curvature loss. The 
theory breaks down when the curvature is so severe that the field near 
the waveguide can no longer be approximated by the field of the straight 
guide. The limits of applicability of the curvature loss theory can be 
expressed by inequalities for the waveguide parameters. 


II. THE FIELD OF THE STRAIGHT ASYMMETRIC SLAB WAVEGUIDE 


The field of the straight asymmetric slab waveguide is obtained as the 
solution of a straightforward boundary value problem. The geometry of 
the structure is shown in Fig. 2. We assume that there is no field variation 
in the y direction so that the waves of the structure are simple TE and 
TM modes. We limit our discussion to TE me The field is then given 
by the following equations:° 


E, = Ae?” d 


IIA 


r< a, (1a) 


E, esd, (1b) 


A cosx(z — d) — * sin x(a — d) —d 


IA 
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E, = A(cos 2xd + *sin Dade? —-«x <x —-d. (1c) 


The amplitude of the field can be expressed by the power P carried by 
the field: 


A = 2 | ne (2) 
i +t\, 2 2 

|o(2a + 2+ the +9) 

A factor exp [i(wt — 8z)] has been suppressed. The constants and param- 

eters appearing in equations (1) and (2) are defined as follows: 


w = 2rf, radian frequency, 
P = power carried by the mode, 
Ho = magnetic permeability of free space, 
€é. = electric permittivity of free space, 
8 = propagation constant, 
2d = slab thickness, 


« = (nk — Bp’), (3) 
y = (8 — nik’), (4) 
6 = (6° — nk’), (5) 
k = wlEouo)*, (6) 


nm, = refractive index in the region d < 2, 
MN, = refractive index in the region —d < x < d, 
nm; = refractive index in the region —» < x < — d. 


The magnetic field components are obtained from the equations 


i ak, 
H. = WH, Oz @) 
Z 
x 
N3 N2 n 
ed 


Mig. 2—Straight slab waveguide. m1, m2, and ns are the refractive indices of the three 
media. 
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and 
2 OE, 
Wi, OX 


z 


(8) 


The boundary conditions (requirement of continuity of the transverse 
electric and magnetic field components at the two interfaces) lead to the 
eigenvalue equation for the propagation constant 8, 


as fa a 
(1 — 1) 
‘en 


III. THE FIELD OF THE CURVED STRUCTURE 


tan 2xkd = (9) 


The solution of Maxwell’s equations in the region (R +d) <r< 
can be expressed as follows: 


E, = BH® (nkr)e7*”*, (10) 


The Hankel function of the second kind and of order v represents an 
exact solution of Maxwell’s equations in the coordinate system shown in 
Fig. 1. There is no field variation in the direction of the axis of the 
cylindrical polar coordinate system of Fig. 1. (In the coordinate system 
used in Fig. 2 the direction of the polar coordinate axis would be y.) The 
radial distance r is measured from the center of curvature of the bent 
waveguide. The Hankel function of the second kind is required since at 
infinite distance, r — ©, an outward traveling wave must result. With 
our time dependence, exp (twt), the field of equation (10) satisfies this 
requirement. 

The order number » need not be an integer in this case since we need 
not require periodicity of the field as a function of the polar angle ¢. If 
we were interested in an exact solution of the problem of the mode 
traveling along a curved waveguide we would obtain the value of v as the 
solution of an eigenvalue equation. In our approximate treatment we 
assume that the field near the waveguide can still be approximately 
described by the field of the straight structure. We can use the coordinate 
system of the straight guide (Fig. 2) to describe the curved guide. The z 
axis of the straight coordinate system becomes bent and we have the 
relation 


z= Ro. (11) 


The function exp (—7@) is equivalent to the propagation factor exp 
(—76z) of the straight waveguide so that we have the approximate 
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B= (12) 


Puls 


Since FR is much larger than the wavelength of the field, » is a very large 
number. Using v as defined by (12) in (10) constitutes the first approx- 
imation. It remains to obtain a relation between the amplitude factor B 
and the power that is carried by the guided mode. To achieve this we use 
an approximation for the Hankel function that is valid for very large 
order number » in the region 


vy > nkr. (13) 


The desired approximation can be found in Ref. 4. 


v(a— tanh a) 





A? kr) = -i ———— (14) 
vis 
i 9” tanh a 
with 
Vv 
cosh a = ae (15) 
We obtain the hyperbolic tangent of a by the relation 
r\? |? 
2, 47 Ei —_ (nak | | 
u = tanha = io = x . (16) 








cosh a 3 


Equation (12) was used to replace v by 8. Near the axis of the waveguide 
we can use the approximation r/R ~ 1 so that we obtain from (4), 
(12), and (16) 

ytanha = yR. (17) 
This approximation is adequate for the denominator of (14). The ex- 
pression in the exponent must be approximated more accurately. We 


use the z coordinate to describe the radial distance from the center of the 
waveguide core and write 


r 2 
This x coordinate corresponds directly to the x axis of the straight guide - 
as shown in Fig. 2. Using (4) and (16) we obtain 
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2 2 x)’ 
é (y — 2(mk) z) 4 E - (nb) a}, igs 
2 B ae y/ R 
We can express a by a well-known relation between the inverse hyper- 
bolic tangent function and the natural logarithm* 





. 1 

a = tanh u = 4)n (Lt), (20) 

The logarithm can be expanded in an infinite series 
a=utivtiwt wre, (21) 

From (19) we obtain approximately 
» _ (vy nuk é (xy 

= (~) — pf) 5 (2). 22 
i (x) of y/ R\6 (22) 


Substitution of (22) into (21) yields 


_¥ 1 (x) 2 (2) a 
a at 3\s T5\6 + 


— (ef ea(i ats GY +) + aA (23) 
y/ KB 8° s° B° 
The first series can again be expressed by the logarithmic function. The 


second part of (23) contains a simple geometric series. We thus obtain 
the approximation 





1+; 2 
Pee air __ 86 _ (ky 1 2, (24) 
1-%j WW y_vk 
B 2 
With the help of (4), (19), and (24) we can form the expression 
i+ ; ? 
a — tanha =a — aa | tae (25) 
B 


The approximations (12), (14), (17), and (25) allow us to express the H, 
component (10) in the following approximate form: 


| 
Y 
fie, +3 | 
2 


Pin |-—"| —1 br 
exp | . Pe 7 ¥. | 
Wi epee a NB) oe 06) 


é€é € 
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The functional dependence of (26) on the coordinate x coincides with 
(1a). 

For large values of # the field of the curved structure near the wave- 
guide is approximately equal to that of the straight guide. The factor 
exp (—718z) appearing in (26) was omitted from (la). Comparison of (la) 
and (26) allows us, with the help of (2), to determine the amplitude 
coefficient of the field, 





| 5 wou cyRP 
B = 2xe? 7 aaa a oe 
re + : + Le + 7’) 
1+% | 
, _|18 Bi 
= ae 1 si (27) 


Very far from the waveguide, r > R, the Hankel function can be 
expressed by its approximation for large argument’ so that (10) assumes 


the form 
EB = B a we aa (28) 


Equation (28) is very interesting. It shows that far from the waveguide 
the field is very different from the field of the straight structure. Whereas 
the field of the curved structure decays exponentially near the wave- 
guide it assumes the form of a radiation field far from the guide. This 
behavior of the exact field solution (10) of the curved structure explains 
why curved dielectric waveguides lose power by radiation. 





IV. THE BENDING LOSS FORMULA 


Since we know the field far from the waveguide it is now easy to 
calculate the power loss caused by the fact that energy is radiated away 
from the waveguide. The amplitude of the radiation field is independent 
of the z coordinate. The z dependent factor in (28) determines only the 
phase of the field. The power loss suffered by the field at a given position z 
can thus be calculated by the radial power flow at the same position z, 
even though the contribution to this radiation may have come from a 
point 2; with 2; < z, because each length element of the guide contributes 
an equal amount of radiation. An element of unit length on the axis of the 
waveguide is projected on an element of arc length 


ip 
L= R (29) 
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at a distance r — R from the waveguide. The power loss 2a per unit 
length of waveguide js thus 


r S, 
2a = RP’ (30) 
S, is the r component of the Poynting vector, 
S, = —4B,H¥ = Inia |B, P, (31) 


and P is the power carried by the mode. Using (27), (28), and (31) we 
obtain from (30) 





2x4, —-U 
ieee OO 3 (32) 
(ny — nk a( 2a + 5 + 1) 
with 
Lhd 
B B aR iy R. (33) 


U = Fs a 
ac a 


The approximation on the right-hand side of (83) holds for y/8 « 1. 
The relation x? + 7’ = (n2 — n3)k’ was used to simplify (32). The range 
of validity of the bending loss formula (82) cannot be given precisely. 
We have already encountered the inequality (13) that was necessary 
for the approximation (14) to hold. A similar inequality can be stated 
for the field expressed by Bessel and Neumann functions inside of the 
waveguide. In order to be able to express the field inside of the curved 
waveguide by approximate expressions that reduce to the sine and cosine 
functions appearing in (lb) in the limit of large radius of curvature, 
we must require 


pe Nekr (34) 


everywhere inside of the waveguide. We can express these conditions in 
the form 


b> nik(1 + g) (35) 
and 


B< nik( 1 is a). (36) 
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The validity of our theory becomes doubtful if one or both of these 
inequalities are violated. However, a comparison with Marcatili’s 
theory’ shows that our approximation is still quite good even in regions 
where (86) no longer holds. 

The simple expression (32) for the bending loss of an asymmetric 
slab waveguide can be used only if the values of «x,y, 0, and 8 are known. 
It is, of course, only necessary to determine one of these parameters 
from the eigenvalue equation (9) since they are all interconnected by the 
equations (3) through (6). 

It is useful to point out that the loss equation (32) seems to be ap- 
plicable to other types of waveguide than the one for which it was 
derived. I have compared experimental values” of bending losses of a 
round optical fiber with the loss predicted by (82). For such a comparison 
it is necessary to use the parameters x, y, etc., that apply to the wave- 
guide to which the formula is to be applied. In case of the round fiber the 
parameters of the HE,, mode were used to compute the bending loss 
from (32). The reason for this choice of parameters is the fact that the 
parameter y determines the decay behavior of the field outside of the 
waveguide. It is very important that the proper field decay is used, so 
that it is more logical to use the y value of the round fiber instead of the 
value computed from (9), if (82) is to be used to compute the bending 
loss of the round fiber. 

It is a curious fact that the loss formula (32) can also be obtained 
without use of the Hankel function appearing in (10) if we use a field of 
the form . 


C ‘ 
E, = War exp - if y(r) ar} (37) 
with ; 
yr) = E x oa nit | (38) 


The validity of this claim can easily be checked by performing the 
integration. The factor in front. of the exponential function is somewhat 
arbitrary. However, the exponential function itself admits of a physical 
interpretation. 

The straight waveguide has a field that, outside of its core, behaves 
according to exp (—yz). In the curved system z is naturally replaced 
by r. If we consider that the process of bending the waveguide is likely 
to lead also to a distortion of the phase fronts we can try to describe 
the separation of consecutive wavefronts by an r dependent wavelength 
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(A, is the wavelength of the straight guide), 
Mr) = 2, o (39) 
The propagation constant is related to the wavelength by the relation 


20 R 
B(r) = Ar) = B, YT 
By replacing the propagation constant in (4) with (40) we obtain (88). 
Since y is now no longer a constant it is natural to replace yr by f y dr 
and thus arrive at the form of the exponential function appearing in (87). 
By using (37) instead of (10) and proceeding exactly as shown in this 
paper we obtain (82) once more. It is interesting that we have thus 
obtained an approximation for the Hankel function that holds for the 
region where the order number is very nearly equal to the argument as 
well as for the region where the argument is much larger than the order 
number. Equation (38) shows clearly that y(r) changes from real to 
imaginary values as r increases.' 

One might hope that:a similar procedure would allow us to obtain 
approximate expressions for the bending loss of the round optical fiber. 
However, such attempts lead to equations that are not in agreement 
with experiment. On the other hand, the loss formula (82) agrees well 
with experiment” if the parameters of the round fiber are used. 

The loss formula (32) holds for all values of the refractive indices 71, 
M2, and n3 for which mode guidance is possible. Small index differences 
are not required for (82) to be valid. 


(40) 


V. NUMERICAL EXAMPLES 


Because of the large number of variables involved it is not possible to 
provide graphic displays for all possible applications. The loss formula 
(82) is sufficiently simple (except for the need of knowing the waveguide 
parameters x, y, etc.) so that loss values for cases of interest can easily be 
calculated. We provide curves that aid in computing the bending loss of 
the even TE mode of the symmetric slab waveguide. 

Figure 3 is a comparison of our theory with the results of Ref. 1. 
The ordinate is the function (2A)'aR while (8A)?2dn2/\ is plotted on 
the abscissa. The parameter A is defined as no — n; (we are using 
nN, = 3). The expression 36n,RA?/\ assumes the constant value 60 
for the curve of Fig. 3. The agreement with Marcatili’s theory’ is re- 


t A discussion of bending losses based on a similar argument is presented in Ref. 5. 
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Fig. 3—Comparison between this theory and the theory of Ref. 1. A = n, — m1, 
Ni = N3, 86nRA?/X = 60. 


markably good. The inequality (36) is violated for values of the abscissa 
that are larger than 1.7. This may explain the departure between the 
solid curve representing equation (32) and the dash-dotted curve 
representing Marcatili’s theory. The fact that the solid curve actually 
touches the asymptotic value shown as a dotted line in Fig. 3 (this 
value is assumed for infinite values of the abscissa, its location as shown 
in the figure has no meaning) is probably an accident since the solid 
line increases again for larger values of the abscissa outside of the range 
shown in the figure. 

We restrict ourselves to the case of the symmetric slab waveguide 
with n, = ngs. It is possible to express « and y (which equals @ in this 
special case) as functions of 


V = (ns — najkid. (41) 


The propagation constant 6 depends not only on V but also on the value 
of n,k so that it is not possible to express the bending loss only in terms 
of V. However, in the special case that n; and n, are very nearly equal we 
have 8 = nk so that we need not actually solve the eigenvalue equation 
to obtain the propagation constant. To aid in the evaluation of the 
bending loss formula we provide curves for 2a6d’e” and for d°8’U/R in 
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Fig. 4 as functions of V. The approximate expression given in (33) was 
used to express U. If 8 can be approximated by n,k these two curves 
enable us to calculate the bending loss without any difficulty. For 
symmetric slab waveguides with a large value of nz — n, we can calculate 
6 with the help of (4) from known values of y and n,k. A plot of yd as a 
function of V is provided in Fig. 5. The parameter y is interesting in 
itself since it determines the exponential decay of the guided mode out- 
side of the waveguide core. 


VI. CONCLUSIONS 


The bending loss of an asymmetric slab waveguide has been calculated 
using an approximation that is based on the assumption that the field 
near the bent guide is still almost identical to the field of the straight 
guide. The results of this approximate theory are in good agreement 
with the bending loss theory of Marcatili* in the range of applicability 
of our theory. It is hard to apply a similar analysis to the bent round 
fiber because the exact form of the solutions of Maxwell’s equations for 
the curved structure is not known. However, the bending loss formula 
obtained for the slab waveguide model yields good agreement with 











Fig. raat functions 2a6d?e" and d*62U/R are plotted versus V = (nz — 1 )?kd. 
(mn = 13. 
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Fig. 5—Plot of yd as a function of V. (m = n3.) 


experiment’ if the mode parameters of the fiber mode are used in the loss 
formula instead of the mode parameters of the slab waveguide. 

For the case of small index differences, curves that allow the deter- 
mination of the bending loss of the lowest order symmetric TE mode of 
the symmetric slab waveguide are provided. 
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Attenuation of Unwanted Cladding Modes 


By D. MARCUSE 
(Manuscript received April 5, 1971) 


This paper contains design criteria for the suppression of cladding 
modes in optical fibers by means of a lossy jacket. The calculations are 
actually based on the slab waveguide model. However, 2t 1s believed that the 
results are representative, atleast to order of magnitude, for the round 
optical fiber. It 1s found that the cladding modes are absorbed most effectively 
if the real part of the refractive index of the lossy jacket equals the refractive 
index of the cladding material. However, slight deviations from this optimum 
design are not critical. . 

The loss of the cladding modes depends on. many parameters so that 
general statements as to the order of magnitude of the expected loss are hard 
to make. However, a loss of 1 dB/m should be easily obtainable. 


I. INTRODUCTION 


Losses in dielectric optical waveguides have been discussed by several 
authors.''? In two earlier papers’ '* we explored the problem of crosstalk 
between two parallel optical waveguides. Two types of crosstalk were 
considered. The directional coupler mechanism couples two dielectric 
waveguides even if they have no imperfections of any kind. The coupling, 
in this case, is caused by the exponentially decaying field tail of one 
guide reaching to the region of the second guide. The second mechanism 
causing crosstalk involves light scattering from waveguide imperfections. 
The light that is scattered out of the guided mode of one guide can be 
scattered back into the guided mode of a neighboring waveguide. Both 
types of crosstalk can, in principle, be reduced by placing a lossy 
material between the two guides. However, it was concluded that 
isolation of the waveguides by means of a lossy surrounding medium is 
really not necessary to avoid crosstalk. The waveguides must be de- 
signed in such a way that their mode losses are low. The same mech- 
anism that influences crosstalk also influences the mode loss. For 
example, if two guides couple appreciably by means of the directional 
coupler mechanism their mode fields penetrate each other to a con- 
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siderable extent. Adding loss to the surrounding medium thus also 
causes added loss to the mode field. In fact, if the loss in the surrounding 
medium is needed to reduce crosstalk it simultaneously adds so much 
loss to the guided mode as to defeat its purpose. The cladding of the 
fiber must thus be made sufficiently thick to protect the guided mode 
from losses of its environment. But a sufficiently thick cladding also 
provides adequate protection against crosstalk. The scattering cross- 
talk problem, on the other hand, does not benefit from a lossy sur- 
rounding medium for different reasons. Even if the cladding is made 
sufficiently thick to protect the guided mode from losses of the sur- 
rounding medium it is still possible to obtain substantial scattering cross- 
_ talk. However, the same mechanism that produces scattering crosstalk 
also causes scattering loss to the guided modes. Except for the unlikely 
case of a systematic periodic distortion that persists throughout the 
entire length of both waveguides, the scattering mode loss is more 
critical than the scattering crosstalk so that it is necessary to build a 
guide with sufficient accuracy to hold scattering losses down. Once this is 
achieved scattering crosstalk is no longer a problem. 

It thus appears as though it were not necessary to surround the 
members of a bundle of waveguides with a lossy medium in order to 
prevent crosstalk provided that the waveguides are built with low 
scattering losses and with a sufficiently thick cladding. However, there 
1S one more reason why losses in the surrounding medium may be de- 
sirable. We have so far ignored the possibility that power which is 
scattered out of the core of the waveguide is trapped in the cladding. 
These cladding modes are undesirable for two reasons. They couple some 
of their power back into the core causing problems of delay distortion of 
the guided core modes. In addition, these modes may reach the end of 
the waveguide and there enter the detector giving further rise to un- 
wanted, delayed signals. It is thus important to suppress cladding modes 
by providing each waveguide with a lossy jacket. . 

We must keep in mind that the loss requirements for the cladding 
modes are more stringent than the loss requirements for the core modes. 
A loss factor that is prohibitively high for a core mode may be far too 
low for a cladding mode. The cladding mode can be harmful even if it is 
too lossy to propagate through the entire length of the waveguide by 
itself. Cladding modes are continuously excited and reconvert to core 
modes throughout the entire length of the waveguide so that their 
presence is undesirable not only at the end of the waveguide but through- 
out the entire length of the guide. It is thus desirable to ensure cladding 


SUPPRESSING CLADDING MODES 2567 


mode losses far higher than the losses that appear excessive if they 
should occur for core modes. _ 

A very effective way of suppressing cladding modes would be to 
provide the waveguides of the cable with a surrounding medium whose 
index is higher than that of the cladding. In this case the light can no 
longer be trapped in the cladding but would rapidly escape into the 
surrounding medium. However, if it were not attenuated there it would 
still be harmful since some of it can scatter back into guided modes and 
some of it may reach the detectors at the end of the guide giving rise to 
crosstalk and delay distortion. . 

The next best thing, therefore, is designing a surrounding medium or 
jacket with a refractive index whose real part nearly equals the index of 
the cladding but which is sufficiently lossy to attenuate power that tries 
to travel in it. This paper tries to provide the design criteria necessary to 
build effective lossy jackets for the suppression of cladding modes. 


II. CLADDING MODE LOSS FORMULA 


Our interest is, of course, directed toward the cladding modes of 
round optical fibers. In fact, the formula for core mode losses in round 
optical fibers provided in Ref. 3 can be used to calculate cladding mode 
losses in fibers with infinitely thick jacket by letting the refractive 
indices of cladding and surrounding medium coincide and by shrinking 
the cladding thickness to zero. However, a discussion of the cladding 
mode losses for round optical fibers with finite jacket thickness is 
complicated due to the complexity of the round fiber equation. As 
always, the slab waveguide is much easier to treat than the round 
optical fiber. Results obtained for the slab waveguide model are ap- 
plicable to the round fiber at least as order of magnitude estimates. 
Since order of magnitude estimates are all that we need in the present 
situation and since the formulas for the slab waveguide are so much 
easier to evaluate we shall base our discussion of cladding mode losses on 
the slab waveguide model. 

The slab waveguide model to be studied is schematically shown in 
Fig. 1. We ignore the core of the waveguide since it does not contribute 
appreciably to the propagation behavior of the cladding modes. The 
refractive index of the fiber core is so close to the refractive index of the 
cladding in most fibers of interest for communications purposes that we 
can ignore the core altogether. The cladding mode losses are caused by 
the fact that the reflection of the electromagnetic energy at the boundary 
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Fig. 1—Sketch of a dielectric waveguide (core not shown) with a lossy jacket. 


between cladding and jacket is not 100 percent efficient. It is thus the 
plane wave reflection coefficient at the cladding jacket interface that 
determines the cladding mode losses. The reflection coefficient is higher 
for waves whose electric field vector is polarized parallel to the dielectric 
interface (TE modes) compared to waves whose magnetic vector is 
parallel to the interface (TM modes). The modes with the higher 
reflection coefficient have lower loss. The TE modes have lower loss 
than the TM modes. It is thus sufficient to study TE modes since the 
TM mode losses must be higher. A design that succeeds in suppressing 
TE modes does suppress TM modes even more effectively. 

In order to obtain the mode losses for the structure shown in Fig. 1 we 
study the reflection coefficient of a plane wave incident on the structure 
shown in Fig. 2. The electric field vector is given by the equations: 


BE, = Ae? 4. Be te) a <0, (1) 
Bos Ceti ao pe er) Os2rS8A, (2) 
E, = Ge‘ +t) ASxs~, (3) 


The time dependent factor exp (iw) has been suppressed. The param- 
eters x, o, and p are defined by the equations: 


«= (nik? — 6), (4) 
c= (n3k° = 6°)', (5) 
p = (nik? — p)}, (6) 


with 
k = w(€otlo)? —_ 21/Xo . (7) 
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The magnetic field components are obtained from EF, with the help of 


4 a, 
Why OZ 


7 OL, 


so 
sf Wo OX 


) 24S 


(8) 


The boundary conditions at the two dielectric interfaces determine the 
relations between the amplitude coefficients: 


a(k — p) coscA + t(kp — o°) sin oA 














a(k + p) cos cA + t(kp + 0°) sin cA~’ (9) 
_ K(o + pe” 
a a(k + p) cos cA + i(kp + a’) sin gA A, (10) 
_ K(o os eo 
ee a(x + p) cos cA + t(kp + 0”) sin cA A, (11) 
- Qnoei?® 
oo sin oa 4 (12) 


a(k + p) cos cA + t(kp + A’) sin cA 


The field that is described by the equations (1) through (12) is the 
cladding mode field of the waveguide. The type of mode that actually 
results depends on the width 2D of the guide and on the angle at which 
the plane wave travels in the slab; that is, it depends on the ratio «/@. 
For the calculation of the mode losses we do not need to know this ratio 
very accurately. It is clear, however, that lossless modes satisfy the 





Fig. 2—This diagram shows the interference between an incident wave and the 
wave that has penetrated into the jacket. 
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requirement | A | = | B |. The transverse field distribution in the slab at 
—D <= «Ss Dis thus a standing wave. Since it is very hard to achieve 
a large loss coefficient a for the cladding modes we can assume a « 1/ 
and have 


[art «1. (13) 


In order to obtain the mode loss, we calculate the power flow along the 
waveguide axis, 


— 1 ° H * ‘se f f) e \ 
Poe Re ie I. HH dep = Retg— | [By Pace (14) 


Re indicates the real part of the quantity in brackets. The integral is 
extended only over the cladding region of the waveguide assuming that 
most of the power is contained in this region. For reasonably low loss 
this condition is satisfied. Using (1) and (18) we obtain 


P, w BP AP. (15) 
wy 


The amount of power at the cladding boundary that flows per unit area 
out of the waveguide is given by 








= lj * — K | A ie ( Sear B ‘) . 
S, = Re {3H,H*},-0 Sian 1 — z (16) 
The power loss coefficient 2a (a is the amplitude loss coefficient) is 
Spc oe (17) 


The factor 2 on the right-hand side of (17) takes account of the fact that 
equal amounts of power are lost at the cladding jacket interface on either 
side of the waveguide. Substitution of (15) and (16) into (17) results in 
the desired formula for the loss of the cladding modes: 


2a = — (i - | 4). (18) 
“28D A 
The coefficient B/A is obtained from (9). We consider only the case that 


the medium outside of the waveguide jacket has the lowest refractive 
index, n3 < | m2| and nz; < 7m. We thus have 


p= —1ly (19) 








with 
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y= (6 — n3k’y? (20) 

being a real quantity. We set 
7=uU-— WwW (21) 

with u and v both real and positive. In the limit of an infinitely thick 
jacket, A — ©, we obtain from (9) and (18) 
= 2ku 
~ BD? + + + Qu) 
It is interesting to consider two different alternatives. First we assume 
that 





2a (22) 


Re(n3k’) > B. (23) 


The refractive index of the jacket material is a complex number 


NW =n, — Je (24) 


a; is the amplitude loss coefficient of a plane wave traveling in the 
jacket. We assume 


ni, > (25) 
and obtain from (5) and (21) 
u = [200 + (oo + Anzk’a)*]? (26) 
and 
v = [300 + $(00 + 4n;kai)*} (27) 
with 
oo = (nk? — B’)}. (28) 


It is apparent from (22) that a vanishes for u = O and for u = «. This 
indicates that 2a must assume a maximum as a function of u. By differ- 
entiation of (22) we find that the maximum is located at 


Umax = (+ 0°), (29) 


The maximum value 2a,.x that corresponds to (29) is given by 


KC 
BD[(? + 0°)? + x] 


In order to express the condition for up. in terms of the refractive 





(30) 


2Qmax = 
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indices of the waveguide we square (29) and substitute (26) on the 
left-hand side and (27) on the right-hand side of the squared equation. 
We obtain the solution 


Oo = kK. (31) 


Comparison of (4) and (28) shows that the maximum cladding mode loss 
for-an infinitely thick jacket is obtained if 


N, =M™, (82) 


that is, for the case that the real part of the refractive index of the 
jacket is equal to the refractive index of the cladding. The requirement 
of an infinite jacket thickness means only that the effect of the boundary 
between the jacket and the outside medium must be negligible. As a 
practical matter this condition is satisfied if the product a;A is larger 
than about 10 dB. For an infinitely wide jacket and as long as the 
condition (23) holds we obtain maximum loss for vanishing », that is, in 
the case of a lossless jacket. Increasing the loss of the jacket decreases 
the loss of the cladding modes slightly as long as v remains smaller than x. 
For very high losses of the jacket material the cladding mode loss de- 
creases with increasing jacket loss. This consideration shows that the 
loss of the jacket material must not be made too high. 
In the opposite case 


Re(nzk’) < p° (33) 
we obtain | 
w= [—} 8 + 4(6' + 4nik'ai)*} (34) 
and 
v= [b 8? + 4(6* + Anzk'ai)* (35) 
with 
5 = (6° — n2k’y (36) 


For vanishing loss in the jacket we now obtain u = 0 so that the cladding 
modes propagate without loss in this case. The waves are totally in- 
ternally reflected at the interface between cladding and jacket. The 
condition for uns given by (29) can now not be satisfied. For numerical 
calculation we shall always assume that the cladding modes are so 
tightly guided that we can use 


B= nk. (37) 
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Far from cutoff the values for x are obtained from* 
cD = (t+ 5 (38) 


with integer values of v. Even integers of vy belong to even modes while 
odd integers belong to odd modes. Equations (22) and (30) show that the 
lowest order mode (v = 0) propagates with the lowest loss. Since we are 
interested in maximizing the losses of all the modes it is thus sufficient to 
consider the loss of the lowest order even TE mode and use 


v 

kD = 3° (39) 
For finite thickness of the jacket material and sufficiently low loss the 
cladding mode losses vary as a function of cladding thickness A. Figures 3 
and 4 show these loss fluctuations for the case kD = 50, n. = 1.6, 
n, = 1.65, and ns = 1. The jacket loss was assumed to be 2a;D = 0.1 
dB in case of Fig. 3 and 1 dB in case of Fig. 4. These loss variations as a 
function of jacket thickness complicate the design of an optimum 
jacket. Since we need a certain minimum loss for the effective suppression 
of all cladding modes we are interested in the minimum values of the 
loss. It is thus necessary to find the position of the loss minima of 
equation (18). Since it is difficult to find the minimum values math- 
ematically from the loss expression we use a physical argument based on 

ray optics for the case n, > 7). 

Figure 2 shows the geometry of the problem. Minimum loss is obtained 
if the light rays that are reflected from the cladding-jacket boundary 
add in phase to the light rays that penetrate into the jacket, are reflected 
at its outer boundary, and reenter the cladding. The condition for 
minimum cladding mode loss can thus be stated as follows 


(n-kn + $2) — (mké + 1) = 2vm. (40) 


The distance from point A’ to point D’ in Fig. 2 is €, the distance from 
point B’ to C’ and on to D’ is n, and the phase angles ¢, and @¢2 are the 
additional phase shifts that the wave suffers on reflection from the inter- 
face between cladding and jacket and between jacket and the outside 
medium. The number » must be an integer. Using simple geometry we 
obtain 





1 sin 6 (41) 


The angle of the incident ray is shown vastly exaggerated in Fig. 2. 
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2aD IN DECIBELS 


kD=50 2ajD=0.1dB 


n= 1.6 





Fig. 3—Fluctuation of cladding mode loss as a function of jacket thickness A. 
(D = cladding thickness; 2a; = power loss coefficient of the jacket material; k = 
propagation constant of free space; m, ,, and n3 are respectively the refractive 
indices of the cladding, the (real part) of the jacket, and the outside medium). 
kD = 50, 2a;D = 0.1 dB, m = 1.6, n- = 1.65, nz; = 1.0. 


For low-order cladding modes the angle between the incident ray and 
the interface is very nearly zero. We thus obtain from Snell’s law 


; ni |? 
sin 6 = f — a . (42) 


r 


Because the input ray intercepts the interface at grazing angles we can 
approximate & as follows 


= 2A cof'o = BA — 


[n? — n2}* 


The additional phase angle on reflection from the optically denser jacket 


(43) 
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is 
go, = 7. (44) 


The phase angle ¢2 is more complicated. From the Fresnel formulas for 
the transmission of a plane wave across a dielectric interface we obtain 


oe (ny — | 
¢2 = —2 arctan feat ey (45) 
Collecting all these equations we obtain from (40) the jacket width that 
minimizes the cladding mode loss: 


2 2\3 

ye + 2 + arctan i “ 

A —_ 2 (n; * ee mi)*) 
sca k(n — ni)? 








(46) 
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Fig. 4—Same as Fig. 3 except that 2a;D = 1 dB. 
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Numerical values of (46) agree with the position of the minima appearing 
in Figs. 3 and 4. 


III. NUMERICAL RESULTS 


The theory presented on the preceding pages contains the information 
for the design of lossy jackets for the purpose of suppressing cladding 
modes. Because of the large number of independent parameters entering 
equations (9) and (18) it is impossible to draw curves covering all 
possible cases of interest. We are thus limiting the discussion of numerical 
results to a few cases hoping to show the trend and the order of magni- 
tude of the cladding mode losses that may be obtainable. 

All curves are calculated for a cladding index n, = 1.6 and an index 
of n; = 1 for the surrounding medium. Most curves apply to the case 
kD = 50. Those curves are drawn as solid lines. Broken curves are 
examples for kD = 100 and dash-dotted curves apply to kD = 500. A 
comparison between these three kD values gives some indication of the 
dependence of the cladding mode losses on the kD parameter. 

The cladding mode loss 2eD for a guide length equal to the half 
width D of the cladding is shown in Fig. 5 for n,/n: > 1 as a function of 
the jacket loss 2a,;A. (D = cladding half width, A = jacket thickness, 
both losses are expressed in dB). Figure 5 was computed from (9), (18), 
and (46) for the minima of the fluctuating loss curves (compare Figs. 3 
and 4). It is important to remember that the first loss minimum occurs 
for Anin Of (46) for v = 0. The curves of Fig. 5 do not apply to values of 
the jacket thickness A that are much smaller than this minimum thick- 
ness. The loss curves shown in Figs. 3 and 4 drop off very rapidly for A 
values that are smaller than the value at the first minimum so that the 
mode losses obtained from Fig. 5 would be much higher than the actual 
losses if the figure were applied to A values that are smaller than the 
first minimum. For jackets that are thicker than the minimum value 
(46) with » = 0 the curves of Fig. 5 show the lowest possible cladding 
mode loss. The loss curves shown in Fig. 5 are correct for small values 
of 2a;D (note that we have just replaced A with D). As long as it stays 
below a certain critical value the curves are independent of 2a,D. For 
larger values the cladding mode losses decrease with increasing 2a,D. 
The critical value is different for each curve shown in Fig. 5. The depen- 
dence of the cladding mode losses on 2a,;D is shown in Fig. 6. This 
figure presents the values of the cladding mode losses for infinite jacket 
thickness A as a function of 2a;D. It is apparent from Fig. 5 that the 
cladding mode losses become independent of the jacket thickness for 
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Fig. 5—The minimum cladding mode loss as a function of the jacket loss 2a;A 
for different values of n,;/n; > 1. 


sufficiently large values of A. Figure 6 shows the mode losses of Fig. 5 
after they have leveled off (A > «). It can be seen that the cladding 
mode losses become less sensitive to the value of 2a,D for increasing 
values of the jacket-to-cladding index ratio n,/n,. However, the loss 
curves for n,/n; = 1.001 become dependent on the jacket loss coefficient 
a; already for 2a,D = 1 dB. It is important to keep this in mind when 
using Fig. 5. 

Comparison of the dash-dotted, broken, and solid lines in Figs. 5 and 6 
shows that the cladding mode loss depends on the kD parameter as 
(kD)~* for A— ©. It is very hard to see this relation from (18); however 
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Fig. 6—Cladding mode losses for infinitely thick jacket, A > ©, as functions 
of the jacket loss parameter 2a;D for n-/n > 1 


it can be seen from (22) and (26) for small values of v. It is interesting to 
observe that the maximum loss (30) depends on kD only as (kD)*. 

In order to obtain a feeling for the absolute loss values that can be 
achieved let us assume that the cladding half thickness is D = 10um. 
The loss factor 22D = 10°” dB thus corresponds to a cladding mode loss 
of 2a = 1 dB/m. Cladding mode losses well in excess of 1 dB/m are thus 
easily obtainable. In fact it appears possible to design jackets that 
provide cladding mode losses of 100 to 1000 dB/m. 

So far we have considered jackets with a refractive index whose real 
part is larger than the index of the cladding material. If the real part of 
the jacket index is smaller than the cladding index, n,/n,; < 1, the 
cladding mode losses are no longer even approximately independent of 
2a;D. The loss curves can thus not be drawn as functions of 2a;A. 
Figures 7, 8, and 9 are representations of the cladding mode losses for 
n,/n, < 1 as functions of A/D. The parameter 2a;D is used to label the 
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different curves in each figure. Comparison shows that for equal values 
of kD and 2a;D the loss curves are nearly identical for all three figures 
for low values of A/D. The leveling off of the curves for large values of 
A/D is, however, quite different for each value of n,/n1. The dependence 
of the cladding mode loss for n,/n1 < 1 as a function of 2a,;D for A— « 
is shown in Fig. 10. The loss curves of Fig. 10 show maxima that shift 
as a function of n,/n1. The mode loss 2aD is not an oscillating function of 
the jacket thickness if n,/n, < 1. The curves shown in Figs. 7 through 9 
are thus not the loss minima but the actual loss values as functions of 
A/D. 

The dependence on the kD parameter for n,/n, < 1 can be seen from 
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Fig. 7—Cladding mode losses as functions of the jacket thickness for n/m = 0.999. 
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Fig. 8—Cladding mode losses as functions of the jacket thickness forn,/m = 0.99. 


the figures to be as (kD)~* for A — while the dependence for small 
values of A/D is as (kD)~”. 

Even for jackets with a refractive index whose real part is smaller 
than the cladding index, losses well in excess of 1 dB/m are obtainable. 
However a comparison of the two cases, n,/n, > 1 and n,/ny < 1, 
shows that jackets with a refractive index higher than the cladding 
index provide more losses to the cladding modes. 

Lossy jackets need not be very thick to be effective. For example, for 
kD = 50 and A/D = 0.1 (for D = 10um this would correspond to a 
jacket lum thick), we could choose n,/n; = 1.1 and obtain a cladding - 
mode loss of 2aD = 2 X 10°° dB (or 200 dB/m) with a total jacket loss 
of 2a,;A = 1 dB or 2a;D = 10 dB. Figure 6 shows that the curve with 
n,/n, = 1.1 1s still constant at 2a,;D = 10 dB so that Fig. 5 is applicable 
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for a jacket loss of that magnitude. Increasing the jacket thickness to 
much larger values would triple the cladding mode losses (in dB) in this 
case. An increase of kD to kD = 500 reduces the loss to 7.2 X 10°° dB 
(or 7.2 dB/m). 


IV. CONCLUSIONS 


Losses for cladding modes have been calculated with the help of a 
slab waveguide model with a lossy jacket. It has been found that the 
cladding mode losses are maximized if the real part of the refractive 
index of the jacket material equals the refractive index of the cladding 
material. However, high cladding mode losses can be achieved with 
jackets whose real part of the refractive index is either lower or higher 
than that of the cladding material. Losses in excess of 1 dB/m are easily 
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Fig. 9—Cladding mode losses as functions of the jacket thickness for n-/m = 0.95. 
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Fig. 10—Cladding mode losses for infinitely thick jacket as functions of the jacket 
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loss parameter 2a,;D for n-/m < 1. 


achieved. A careful design should make it possible to obtain cladding 
mode losses between 100 and even up to 1000 dB/m, depending on the 


kD value at which the fiber is operated. 
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Improved Intersymbol Interference 
Error Bounds in Digital Systems 


By Y. S. YEH and E. Y. HO 
(Manuscript received April 21, 1971) 


A thorough solution to the problem of determining the error rate of a 
digital communication system with intersymbol interference and additive 
Gaussian noise is presented in this paper. The solution achieves for the 
first time a combination of computational simplicity and a high degree 
of accuracy, and is obtained by deriving tight upper and lower bounds 
on the error rate. It 1s shown that, for a system with a normalized peak 
distortion less than unity, these bounds can be made to differ by an arbi- 
trarily small amount. The numerical evaluation of the bounds takes less 
than one second on the GE-Mark II time-sharing system for almost all 
the cases. 

Examples are gwen for 2M-ary digital systems to demonstrate the 
accuracy and computational efficiency of our method. The results show 
that our estimates of error rate are generally orders of magnitude better 
than the Chernoff bound. For example, in the case of an ideal bandlimited 
system [(sin t)/t pulse shape] with a signal-to-noise ratio of 16 dB and a 
sampling instant deviation of 0.05 from the optimum value, the lower 
and upper bounds on the error rate are 1.1 X 10°° and 1.2 X 10™*, respec- 
tively. 

This method can also be applied to the calculation of the performance 
of certain phase-shift-keyed systems and certain systems with co-channel 
interference. 


I. INTRODUCTION 


In many cases the transmission efficiency of a digital system is 
largely limited by intersymbol interference rather than by additive 
noise. Intersymbol interference may result from imperfect design of 
the filters, distortion in the transmission channel, nonideal sampling 
instant, or nonideal demodulating carrier phase. In analyzing such a 
digital data system, it is important to determine the system error rate 
due to intersymbol interference and additive noise. 
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Various methods’~’ to evaluate the error rate have been proposed. 
They provide either a loose upper bound of the error rate or the error 
rate of a channel with truncated impulse response. 

In this paper we present a simple method to evaluate both an upper 
and a lower bound of the error rate without invoking the finite pulse- 
train approximation. Furthermore, it is shown that for a system with 
a normalized peak distortion less than unity, the upper and lower 
bounds can be made arbitrarily close thus obtaining an accurate estimate 
of the error rate of the system. This method can be applied to 2//-ary 
AM and coherent phase-shift-keyed systems. 

The data system model will be described briefly in Section II. Various 
proposed techniques to evaluate the error probability and their draw- 
backs are discussed in Section III. In Section IV, we will present new 
upper and lower bounds and the computation of the bounds by a series 
expansion. Applications and the convergence properties of the bounds 
are described in Section V. Throughout, additive Gaussian noise and 
independence of information digits are assumed. 


II. BRIEF DESCRIPTION OF THE SYSTEM 


A simplified block diagram of a digital AM data system is shown in 
Fig. 1. We assume that an impulse 6(¢) having amplitude a, is trans- 
mitted through the channel every T seconds. The system transfer 
function is 


Ri) = S(w)T(w)EH(). (1) 


In the absence of channel noise, a sequence of input signals, 


co 


d. a dt — IT), (2) 


l=-0 


will generate a corresponding output sequence, 


foe) 


2» art — If), (3) 
where r(t) is the Fourier transform of R(w), {a,} is a. sequence of in- 
dependent random variables, and a, = +1, +3, --- + (QM — 1) 
with equal probability for all integers, 1. We also assume that additive 
Gaussian noise is present in the system. Thus the corrupted received 
sequence at the input to the receiver detector is 


yi) = Yo are = 1) +n, (4) 


=-0 
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Fig. 1—Simplified block diagram of a 2M/-ary data system. 


where n(t) is additive Gaussian noise with power o watts. At the 
detector, y(t) is sampled every 7’ seconds to determine the amplitude 
of the transmitted signal. At sampling time ¢) , the sampled signal is 
y(t) = aor(tr) + >> ayr(to — IT) + n(t). (5) 

l=-0 

140 


The first term is the desired signal while the second and the third 
terms represent the intersymbol interference and the Gaussian noise 
respectively. 

The set of slicing levels is’ 


0, +2r (to), t4r(to), Prey + (2m — 2)r (to). (6) 


Based on the decision levels given by equation (6), for a particular 
transmitted signal level, a) , the conditional error probability is 


Ply(to) 2 —2(m — I)r(to)}, A = —(2m — 1) 
Ply(to) S$ 2(m — Yr(th)}, a = 2m— 1 (7) 
Pily(to) 2 (ao + Ur(to) Uy) S (ao — Ir(to)]}, 

ao ~ &(2m — I), 


where AUB is the union of the events A and B. 
Substituting equation (5) into (7), we obtain 


P,(@/do) = 


P{ 2D) aur(ts — IT) + (to) 2 r(to)}, ao = —Qm — 1) 
Pi > ar(t, — IT) + n(to) S —r(to)}, @ = 2m — 1 


P,(¢/do) = ae (8) 
PilD: ar(to — IT) + (toe) 2 r(to) JU 


a air(t: — IT) + n(b) S —r(to)]}, a % +m — 1). 
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Since So. 40 air(t) — LT) and n(ts) are equally likely to be positive or 
negative, equation (8) reduces to 


(P{ SS art) — IT) + nto) = r(to)}, do = (2m — 1) 
P,(e/d) = ee (9) 
2P{ > ar(to — IT) + n(to) = rlt)}, ao ~ &(2m — 1). 


140 


The error rate of the system is 


Pos SS P,(¢/do)P,,(ao) 


all ao 


[2m — 1)/m]P{ > a(t. — IT) + n(t.) = r(t)}. (10) 


We notice that in equation (10) the variables m, a, , and n(é).) have 
already been defined. The sequence r(é) — lJ’) is assumed to be known* 
in the following sense: 


r(t — IT) is finite and known Vle Sy, (11) 
where Sy is a set of N + 1 distinct integers (including 1 = 0) andt 
2 r(t) — IT) = 65 < &, (12) 

Define 
X= Di ar(t — If). (18) 


140 


From equation (12) we conclude that the infinite sum X converges 
absolutely to a random variable and equation (10) can be alternately 
written as 


0 


P.[  @roy* [exp {ly ~ r(t) + XP/20*} dy aF(X). (14) 


all X 


III. REVIEW OF EXISTING METHODS 


The existing methods of evaluating equation (10) can be divided 
into the following categories. 


3.1 Worst Case Estimate 


A worst case sequence’ or “eye pattern” analysis is frequently used 
to analyze a data system. The error probability is estimated by setting 


* The sequence r(é) — /7') is either experimentally determined or calculated 
through the system transfer function. 

t¢,? is obtained through the application of Parseval’s theorem to equivalent 
Nyquist pulse (p. 47, Ref. 1). 
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Yoiw0 Ar(to — IT) to its worst case value in equation (10). In many 
cases, this estimate is exceedingly pessimistic since the occurrence of 
such a worst case sequence is extremely rare. 


3.2. Chernoff Bound 


Recently, Saltzberg? and Lugannani® applied the Chebyshev in- 
equality to equation (10) to obtain the upper bound on error prob- 
ability. We have shown in Ref. 6 that these upper bounds are in many 
cases still too pessimistic by orders of magnitude. 


3.3. Finite Truncated Pulse Train A pproximation*”” 


When r(t) decreases rapidly relative to the sampling period 7, we 
may approximate the channel by a finitely truncated pulse train. 
The error rate can be calculated by enumerating all the possible com- 
binations of intersymbol interference. However, since each calculation 
of the conditional error probability takes a great deal of computer 
time, the number of m” must be held to several thousand.’ This limita- 
tion leads to a poor approximation of the true channel, and the error 
probability so obtained is not very useful. Recently, Hill® has reported 
that by computer simulation of the density function of Xy , the com- 
putation time can be reduced. 


3.4 Series Expansion Method 


Recently, Ho and Yeh’ and, independently, Shimbo and Celebiler’ 
discovered that equation (10) can be calculated in terms of an ab- 
solutely convergent series involving moments of the intersymbol 
interference.* Furthermore, the moments can be obtained readily 
through recurrence relations, and the computation time is greatly 
reduced. A better approximation of the real channel can be obtained 
by increasing the number of terms in the pulse train approximation. 
However, the error in the P, estimate introduced by the truncation 
of the system impulse response is still unknown. 


IV. ERROR BOUNDS AND COMPUTATION TECHNIQUES 


In this section we shall derive new upper and lower bounds on the 
error rates and define the range of applicability of our method. No 
truncation of the intersymbol interference is required. Furthermore, 
this method will give an accurate estimate of the error rate with a 
negligible amount of computation time. 


* Only truncated pulse train approximations are considered. 
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4.1 Upper and Lower Bound of P. 


Let the intersymbol interference be partitioned into two disjoint 
sets where 


Xv = > ar(t — 1D), (15a) 
ay 
and 
Xp = > ayr(to — IT). (15b) 
lgSnv 


Equation (14) can be rewritten as 
P= (@m—/m fof no’ 
all Xn Yall XR 


0 
Ma exp {—[y — r(t) + Xy + Xpl?/20°} dy dF(Xx) dF(X;p). 
(16) 
Proposition 1: P, is lower bounded by 
P, = [Qm—1)/m] [roy 
all XwN 


0 


[exp ([—{y = rlto) + Xx}?/204] dy dF(Xy), (17) 
provided the truncated system has an “‘open eye pattern,” i.e., 


r(t) — >» | r(to — IT) | = 0. (18) 


Proof: The complementary error function is concave upwards for 
negative values of its argument and satisfies the following relationship: 


2 erfe (ge + a) + § erfe (¢ — a) 2 erfe (2), 2-5 0. (19) 


Since Xp is symmetrically distributed around zero and Xy satisfies 
equation (18), we obtain, by applying equation (19), that 


| nx [exp (ly = rlto) + Xv + Xel'/20"} dy aX) 


=| _exp {—[y — rt) + Xw1"/20°} dy. (20) 


Substituting equation (20) into equation (16), we obtain the lower 
bound of equation (17). 
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Proposition 2: P, is upper bounded by 
P, = [@m—1)/m] f — no’y? 
all Xn 


[exp {=ly — rts) + XwP/203} dy aP(Xs), 
where ' 
ap = o (1 — og/0°)", 
op = (1/3)(2m — 1)(2m + 1)o; , 
and o? is defined in equation (12). 
Proof: Applying the following inequality, 
exp {—X;/20°} <1, 


to equation (16), we obtain 


ee = Heme 7 Die = iz = (2n0*)"* 
-exp {—[y — r(to) + Xy]’/20°} 


-exp {—[y — r(to) + Xy|Xp/o°} dF(X,) dy dF(Xy). 


Knowing from equation (15b), 


Xp => >: arr (to = IT), 


lgSn 


the average over X, can be performed, we thus have 
[exp (ly — rte) + XvlXn/0"} dP %s) 


= [I (exp {—[y — r(to) + Xwlarr(t. — IT)/0})e, , 


lgSn 
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(21a) 


(21b) 
(21c) 


(22) 


(23) 


(24a) 


where (g(x)), means expectation of g(x). It has been shown* that the 


following inequality holds. 


(exp {a.2})a: S exp (0*e?,/2) = exp {2*(2m — 1)(Qm + 1)/6}. (24b) 


Substituting equation (24b) into (24a) we obtain 
[exp (ly — rts) + XvlXu/o*} dX) 


= exp {[y — r(to) + Xy}or/20°}, 
* Appendix of Ref. 6. 


(24¢) 
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where o? is given by equation (21c). Substituting equation (24c) into 
(23) we obtain the upper bound of equation (21a). 

It is interesting to note that the upper bound differs from the lower 
bound only through a modification of the noise power by the truncated 
terms. For a system with a peak distortion* less than unity, by taking 
the set Sy large enough of approaches zero, o7 approaches o’, and the 
upper bound converges to the lower bound. Therefore, the exact error 
probability can be located within a small range. The computation 
time involved for large enough N is rather minimal when a digital 
computer is used as will be illustrated in Section V. 

4.2 Evaluation of P, and P,, 


We have already shown in Ref. 6 that equations (17) and (21) can 
be expanded into an absolutely convergent series involving moments 
of the truncated intersymbol interference. 

The series expansion of equation (17) is 


Py, = [2m — 1)/m] exte [=r(t)/(2o)] 
+ [(2m = 1)/m) > [2H 2e* ee)! exp (—1°(l)/20") 


‘Hoy-1((to)/(2?o)) Max, , — (25) 
where 


H.,-; 1s the Hermite polynomial, 
M., is the 2kth moment of the random variable Xx . 


The series expansion of equation (21) is similar to equation (17), 


Py = (o,/a){ (2m — 1)/2m] erfe [—r(t.)/(2?o1)] 


1 


+ [2m — 1)/m] D> (21) exp [=1°(6)/20 


Has(0()/Qes) Maa} (26) 
The moments (/,) can be obtained through the characteristic 
* Normalized peak distortion (D,) is defined as 


Dy = |r(to + nT)|/Ir(to)l. 


n40 
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function of X, without the explicit evaluation of the distribution 
function. The recurrence formula for M7», is 





Mn = —> e = 1 Mana 1)*{2*[(2m)* — 1]/24} | Bas 


i=l 24 —_ 1 
‘| > r(to — IT)"'], (27) 
leSn 


where B,,; are the Bernoulli numbers. 


4.3 Truncation Error Bound of Series Expansion 


The error incurred by truncating the series of equation (25) at 
(n — 1) term is given by 
R, = [(2m — 1)/m] 2) (2k!)"(20°) "(a)"? exp [r(ts)/20°] 
k 


=n 


-Ho,-1[r(to)/(2'o)]* Ma, . (28) 
Let 
\ = max | Xy | = (Qm — 1) Xe | r(to — IP) |. (29) 
140 
It can be shown that the moments satisfy 
Mors2p S Ma,d”, p =0,1,2,°--. (30) 
For (2k — 1) > 2, the Hermite polynomials are upper bounded by 
| Hou-1(a) | S$ 2° 3[(2k — 3)!9] 2k — 1 exp [x”/2]. (31) 


Substituting equations (80) and (31) into equation (28) we obtain the 
following: 


|B, | S (2m — 1)/m]2n)™* exp [—r°(to)/40°] 


co 


"Mon: (20°) ” DU (He!) (2h — 1)*0"/207)™ 


k=n 


i 


< [(2m — 1)/m](2r)* exp [—r*(te)/40°] 
May 22°) 5 (k1)7(2k — 1)72(02/20)*™ 


+ (PI) 2p — 1)4Q"/20°P “(1 — °/ 2pety » (82) 
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Fig. 2—The convergence of series expansion method; (sin zt/7')/(at/T) pulse, 
binary AM system, tp) = 0.05 7, SNR = 16 dB. The set Sy includes the first 12 
sampling points around ép. 


where p is an integer which is chosen to satisfy (\?/2p0") < 1. Similar 
truncation error bounds can be obtained for P,, . 


V. APPLICATION 


The error probability of a 2M-ary digital AM system with an ideal 
band-limiting pulse signal operating over an ideal channel is calculated 
by equations (25) and (26) to determine the convergence of the method. 
The received binary pulse is assumed to be 


r(t) = (sin wt/T)/(at/T). (33) 
The system SNR is defined by 
SNR = (a3)r7(0)/o’. (384) 


The convergence of the series expansion method is illustrated in Fig. 2. 
The system is binary with the sampling instant deviated by 0.057 
from its nominal sampling instant. The SNR is 16 dB. The set Sy 
includes 12 elements, i.e., 2 = +1, +2, --- , +6. It is observed that 
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Fig. 3—Convergence of P; as a function of number of terms in the series expansion; 
(sin rt/T')/(at/T) pulse, 4-level digital AM system, to = 0.05 7, SNR = 23 dB. 
The set Sy includes the first 12 sampling points around fo. 


the series converges after 3 or 4 terms. A similar example is given in 
Fig. 3 for a 4-level system with SNR = 23 dB. 

The convergence of the upper bound to the lower bound with in- 
creased size of Sy is illustrated in Figs. 4, 5, and 6 for binary and 4-level 
systems respectively.* It is observed that the two bounds indeed 
merge together as N is increased. The upper and lower bounds on the 
error rate were calculated using a program written for the GE-Mark IT 
time-sharing system. For the examples given here, computation time 
was less than a second. The change of N from 6 to 30 hardly had any 
effect on the computation time which indicates that one should start 
with Sy sufficiently large such that of is small in comparison with o’, 
probably of the order of 0.2 o° or smaller. Under this condition the 
upper and lower bound should be fairly close. As a comparison, the 
Chernoff bounds are also presented in Tigs. 4, 5, and 6. 

The method given here can also be applied to the calculation of the 
error rate of a coherent phase-shift-keyed system. The error rate 
calculation of a two and four phase system can be reduced to the basic 


* The number of terms in the series expansion used to calculate the points in these 
figures are determined so that the truncation error is negligible. 
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Fig. 4—Convergence of P,, to P;,; ideal band-limited pulse, binary system, SNR = 
16 dB, to = 0.05 T. 


formula, equation (10), which then requires the determination of 


P{ > ar(ts — IT) + (to) = r(to)}. (35) 


The method described here can then be applied. Similar applications 
can also be found in the error calculation of co-channel interference. 
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Fig. 5—Convergence of P. to P,; ideal band-limited pulse, binary system, SNR. = 
16 dB, to = 0.1 T. 
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Fig. 6.—Convergence of P, to P,; ideal band-limited pulse, 4-level system, SNR = 
16 dB, to = 0.05 7. 


VI. CONCLUSIONS 


We have presented a method to calculate the error rate of a coherent 
digital system subject to intersymbol interference and additive Gaussian 
noise. The error rate for a system with a peak distortion less than unity 
can be determined to arbitrary accuracy through the calculation of 
an upper bound and a lower bound of the error rate. The computation 
time involved (less than one second on the GE-Mark II time-sharing 
system) is many orders of magnitude shorter than the time required 
by the straightforward calculation of all the possible states. On the 
other hand, the results are generally much more accurate than the 
results obtained through the application of the Chernoff bound. 
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Dual Frequency Measurements of Rain- 
Induced Microwave Attenuation ona 


2.6-Kilometer Propagation Path 


By R. A. SEMPLAK 
(Manuscript received April 9, 1971) 


The ratio of attenuations measured simultaneously at 18.5 and 30.9 
GHz on a common 2.6-km path during 1970 in New Jersey show that, with 
exceptions for special cases, the Laws and Parsons drop-size distribution 
adequately represents the rain on terrestrial microwave communications 
paths. The ratio derived from cumulative distributions of attenuation is 
also discussed. A rain-induced attenuation of 30 dB is found to be exceeded 
0.001 and 0.01 percent of the time at 18.5 and 30.9 GHz, respectively. 
These results are in good agreement with attenuations derived from rain 
rates obtained on the Holmdel rain gauge network. 


I. INTRODUCTION 


Rain-induced attenuation at centimeter and millimeter wavelengths 
is a function of drop-size; conversely, a measure of drop-size can be 
obtained from multi-wavelength measurements over a common propa- 
gation path. As shown in Fig. 1, a 2.6-kilometer path, shared by both 
18.5 and 30.9 GHz, extends to the southeast of Crawford Hill, Holmdel, 
New Jersey. Both of the receivers are located in the same building at 
Crawford Hill; the equipment has been reported elsewhere,!? it will 
not be discussed here. 

In addition to obtaining a measure of the drop-size distributions by 
examining the ratio of the 30.9-GHz attenuation to the 18.5-GHz 
attenuation during rainstorms, sufficient data are available for a 
statistical examination of the rain-induced attenuations for both 
frequencies. In the following, the cumulative distributions of attenua- 
tion at 18.5 and 30.9 GHz will be discussed individually and then 
related one to the other. 
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Fig. 1—Map showing the propagation path of 2.6 km used at 30.9 and 18.5 GHz. 


II. VARIABILITY IN DROP-SIZE DISTRIBUTION 


The variability of drop-size can be assessed by examining the data 
from the common propagation path, shown in Fig. 2. In this figure, 
the ratios of 30.9-GHz to 18.5-GHz attenuation are plotted versus the 
total path attenuation at 18.5 GHz (the upper plot) and versus the 
total path attenuation at 30.9 GHz (the lower plot). In both plots, 
the dashed line is the theoretical ratio of 30.9/18.5-GHz attenuation 
obtained using a Laws and Parsons drop-size distribution,?* assuming 
a uniform rainfall over the path. It is apparent from the data in both 
plots of Fig. 2 that the theoretical ratios derived from Laws and 
Parsons represent the body of the data rather well, especially for path 
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attenuations exceeding 10 dB at 30.9 GHz; in fact the least squares 
fit to the data is not shown because it lies so close to the theoretical 
curve. However, for attenuations less than about 10 dB at 30.9 GHz 
there is a large scatter in the measured ratios; much of the scatter 
is caused by two particular storms. 

The two specific sets of data are shown in the two plots of Fig. 2 


@@ FEB.1970 
++ JUN.12, 1970 


30.9 GHZ 
18.5 GHZ 





0 5 10 15 20 25 30 35 40. 45 
dB -18.5- GHz (TOTAL PATH) 


RATIO OF ATTENUATION COEFFICIENT ( 


AN . 
“LAWS AND PARSONS 





0 5 10 15 20 25 30 35 40 45 
dB-30.9-GHZ (TOTAL PATH) 


Fig. 2—The ratio of attenuation at 30.9 GHz to that at 18.5 GHz plotted, in 
the upper graph, versus the 18.5-GHz total path attenuation and, in the lower, 
versus the 30.9-GHz total path attenuation. The dashed curve in each plot is the 
theoretical ratio using Laws and Parsons drop-size distribution. 
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by distinctive symbols; these data were measured during periods of 
the month of February, and on the twelfth day of June, 1970. Suffice 
it to say that these are special cases where the large ratios obtained 
in February are due to light rains composed of more small drop diam- 
eters than accounted for by Laws and Parsons, whereas the lower ratios 
obtained in June are the results of drops larger than that given by 
the Laws and Parsons drop-size distribution. If these two special 
cases are removed, the remaining data are well represented by the line 
labeled Laws and Parsons. The spread in the measured data is due 
not only to variations in drop-size distributions but also to a lack of 
homogeneity in the rainfall intensity along the propagation path. 


III. CUMULATIVE DISTRIBUTIONS OF 30.9- AND 18.5-GHZ RAIN-INDUCED 
ATTENUATION 


A sample of 6432 hours of recording on the dual-frequency experi- 
ment was obtained during the period January 1, 1970 through Septem- 
ber 25, 1970. During this period a total of thirty-six individual showers 
occurred. Of these thirty-six, attenuation measurements were obtained 
at both 18.5 and 30.9 GHz for thirty-four. The remaining two storms 
were missed; in one instance, there was an equipment failure at 18.5 
GHz, in the other, a failure at 30.9 GHz. 

Percent-of-time distributions for 18.5- and 30.9-GHz rain-induced 
attenuations are shown in Fig. 3 (by the solid curves) for the record- 
ing period of 6482 hours, not a full year. However, based on earlier 
work,? this period is known to include the months of heaviest rain 
rate, therefore the sample contains essentially all of the high attenua- 
tions which occur in a full year. 

Examination of the solid curves of Fig. 3 shows that for this 2.6- 
kilometer path and a 0.01-percent probability level, the 18.5- and 
30.9-GHz rain-induced attenuations exceeded 15 dB and 30 dB, re- 
spectively. Thus, these levels of attenuation were exceeded for a total 
of 88 minutes of the 6432-hour recording period. Similarly, if one 
examines the 0.001-percent probability level, which for this sample is 
equivalent to about 4 minutes, one finds that the attenuation at 18.5 
GHz exceeded 30 dB whereas at 30.9 GHz, using some extrapolation,* 
the attenuation exceeds 57 dB. 

As stated earlier, the likelihood that the 18.5- or 30.9-GHz attenua- 
tion would have exceeded 30 and 57 dB for the remainder of the year, 
beyond the 6432-hour sample, is remote. If the percentage level is 


* The measuring range at 30.9 GHz is 48 dB. 
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Fig. 3—Percent-of-time distributions of the rain-induced attenuations for both 
18.5 and 30.9 GHz. The solid curves are based on 6432 hours of data, the dashed 
curves are projected yearly distributions. 
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computed on a yearly basis, it is found that the attenuation at 18.5 
GHz exceeds 28 dB 0.001 percent of the time, approximately 5 minutes 
per year, and the attenuation at 30.9 GHz exceeds 54 dB for 5 minutes 
of the year. The yearly attenuation distributions are shown by the 
dashed curves of Fig. 3. 

An interesting comparison can be made between the data points just 
discussed and the attenuation computed*® from the rain rates measured 
on the Holmdel rain gauge network. Figure 4 is a reproduction from 
Ref. 5 of the curves of computed rain attenuations exceeded 0.001 
percent of the year (Fig. 4a) and 0.01 percent of the year (Fig. 4b), 
as a function of path length. These computed curves are based on 
average path rain rates from the 1967 rain gauge network data. The 
current 1970 data at the 0.001-percent probability level for both 18.5 
and 30.9 GHz are shown as crosses on Fig. 4a;* the agreement is quite 


* Figure 4 also shows measurements for the two probability levels at 18.5 and 
30.9 GHz taken on 64- and 1.9-km paths during 1967, 1968, and 1969.2 
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Fig. 4—Attenuation as a function of path length at 11, 18.5, and 30 GHz (1967 
Holmdel rain gauge network data) for (a) 0.001 percent probability and (b) 0.01 
percent probability along with measurements on paths of length 1.9 km (30.9 
GHz), 2.6 km (18.5 and 30.9 GHz), and 6.4 km (18.5 GHz). 


good at 18.5 GHz but at both frequencies the computed curves predict 
a value somewhat larger than the measurements. 

A similar comparison at the 0.01-percent probability level can be 
made if one is willing to assume that no attenuations exceeding 15 
and 30 dB at 18.5 and 30.9 GHz, respectively, occurred for the re- 
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mainder of 1970, beyond the 6482-hour recording period. This is not 
an overly hazardous assumption since experience has shown that the 
tail of the distribution for the measuring period discussed here con- 
tains essentially all the high attenuation data for a year. This as- 
sumption was made with the 1970 data for both 18.5 and 30.9 GHz 
and the results (12.5 and 27 dB) for the 0.01-percent probability level 
are shown by crosses on Fig. 4b. Agreement between computations 
and measured data is fairly good, the curves computed from the rain 
gauge network data again being somewhat conservative. 


IV. RELATIONSHIP BETWEEN THE TWO ATTENUATION DISTRIBUTIONS 


In view of the fact that the 18.5- and 30.9-GHz attenuations are 
measured on a common path, the received signal levels at the two 
frequencies are always attenuated by the same volume of rain and 
therefore fade together. One is justified in attaching significance to the 
ratio of the attenuations at a given percent-of-time level. 

Ratios of 30.9- and 18.5-GHz attenuation for various percentage 
levels (from Fig. 3) are plotted as solid dots in Fig. 5 versus the total 
path attenuation at 18.5 GHz; the theoretical ratiot of 30.9/18.5-GHz 
attenuation obtained using a Laws and Parsons drop-size distribution 
is plotted as the solid curve on this figure. Again, as in Fig. 2, there is 
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Fig. 5—The ratio of measured attenuation from the 30.9-GHz and 18.5-GHz 
distributions of Fig. 3, plotted as solid dots, versus the measured 18.5-GHz total 
path attenuation. The solid curve is the theoretical ratio using Laws and Parsons 
drop-size distributions. 
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good agreement between the ratios obtained from the measured at- 
tenuation and the theoretical distributions based on the Laws and 
Parsons drop-size distribution; thus, it appears that the Laws and 
Parsons distribution holds on the average. The least squares fit to the 
measured ratios has not been plotted in Fig. 5 because of its proximity 
to the computed curve. 


V. CONCLUSION 


Our measurements of the instantaneous ratio of 30.9- to 18.5-GHz 
attenuation appear to be consistent (Fig. 2) with computations based 
on the Laws and Parsons raindrop-size distribution. The attenuation 
ratios have also been derived from the cumulative distributions of 
attenuation (Fig. 3) and these are in very good agreement (Fig. 5) 
with the theoretical computations. Relative values obtained from 
cumulative distributions of attenuation at two frequencies (or more) 
on common paths therefore proves to be a valuable technique for 
probing the rain environment. 
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Optical Modulation at High 


Information Rates 


By GERARD WHITE 
(Manuscript received March 18, 1971) 


This paper describes an optical modulation scheme with a demonstrated 
information rate of 1 Gb/s. The modulation scheme employs an electronic 
multiplexing technique to obtain the high data rate from four 250-Mb/s 
channels. Efficient interfacing between the gas laser beam and the high-speed 
electronic circuits is achieved using a traveling-wave type modulator 
structure. General considerations in optical modulation systems are also 
included. 


I. INTRODUCTION 


One of the major problems in engineering is that of interfacing be- 
tween two differing technologies: This problem is particularly severe 
in laser communications systems where solid state electronics is to 
be interfaced with optics. It is, perhaps, at this technological interface 
that the early hopes of approaching the Shannon-Weaver information 
rates of laser systems have been limited. While electro-optic crystals 
have been shown to have modulation mechanisms operating in excess 
of 10 GHz" and photodetectors with gain-bandwidth products of 
the order of 100 GHz have been produced,? laser communications 
systems, so far, have demonstrated information rates significantly less 
than the capabilities of these components. Significant in the band- 
width limitation of demonstrated laser communications systems have 
been the demands on solid state circuit performance? imposed by the 
optical system and, in particular, the modulator. In this paper an 
optical modulation scheme is described in which system variations 
and subsystem configurations have been adopted in an attempt to 
relax the severe circuit performance requirements so that system in- 
formation rate capabilities approach the bandwidth capabilities of the 
opto-electronic transducers. The system described, using an argon 
laser, has a demonstrated information rate of 1 Gb/s and some 
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techniques for improving upon this rate are proposed. Indicated sys- 
tem noise levels are observed to be low so that it may be possible to 
transmit a multilevel type code which could at least triple the infor- 
mation rate. 

The paper, while describing state of the art results in wide- 
bandwidth optical communications, also provides tutorial matter in 
connection with general considerations in the system design and eval- 
uation. The paper is divided basically into four sections, namely: (7) 
system considerations and optical arrangements, in which it is shown 
how variations in somewhat standard optical configurations can result 
in a high information rate modulation scheme; (iz) interfacing, which 
is mainly concerned with the producing of efficient interaction between 
the electronic circuits and the optical signals with traveling-wave type 
modulators; (227) electronic circuits, which describes how the 1-Gb/s 
binary pulse code modulation (PCM) signals are generated by a 
gating and multiplexing process; and (iv) system performance, with 
general remarks on actual and extrapolated performance data. 


II. SYSTEM CONSIDERATIONS 


Various mechanisms exist for impressing information onto light 
beams including acousto-optic,* magneto-optic,® and electro-optic,°® to 
mention but a few. Applied to laser beams, electro-optic interactions 
have proved most widespread, and in particular, the linear Pockels” 
effect which is exhibited by many dielectric crystals. This effect relies 
on electric-field-induced perturbations of the natural birefringence of 
certain crystals including those in the point group 3m such as lithium 
niobate and lithium tantalate.* Lithium niobate’s susceptibility to 
optically induced inhomogeneities to the natural birefringence’ at 
relatively low optical beam powers makes its use as a modulator 
restricted and, in this application, lithium tantalate has been used. A 
figure of merit for LiTaO, when used as an electro-optic modulator 
is the half-wave voltage; this is the voltage required to induce a phase 
retardance of 180 degrees between components of the beam in the 
ordinary and extraordinary directions. Its value is derived from the 
elements rgg and riz of the electro-optic tensor ;7° 1.e., 


xX c 
ogee ae " 


where n, and n, are the refractive indices experienced by ordinary 
and extraordinary components of the beam respectively. The factor c/a 
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is the inverse aspect ratio of the crystal, c and a being the width and 
length of the crystal respectively. For a value of wavelength equal to 
496.6 nm, the half-wave voltage for LiTaQgz is 2120 volts for the unity 
aspect ratio case. While large aspect ratio modulator rods can be 
prepared, so as to reduce the voltage required for a z-radian phase 
retardance, the value of voltage is still high enough to create severe 
performance demands on modulator driving circuitry in systems which 
rely on an “on-off” type modulator switch. Clearly, a partial modula- 
tion scheme will relax these standards but such a system is not easily 
implemented. It is pertinent at this point to consider some of the 
problems in realizing partial modulation schemes and at the same 
time discuss some of the advantages. 

Consider the basic optical system shown in Fig. 1. It is easy to show 
that the modulated optical power, J,,, at the detector, is related to 
the applied modulator voltage V and the relative phase retardance ¢ 
of the beam components incident at the modulator by 


I, V 
t= [1 — os (6+ 22 +)], (2) 
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Fig. 1—(a) Basic optical modulation scheme. (b) System modulation char- 
acteristics. 
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where J; is the total beam power at the modulator (assuming zero 
losses) and a@ is the retardation introduced by the natural birefringence 
of the crystal. Clearly, maximum beam modulation per unit modula- 
tion voltage is obtained when the x and the y components of the beam 
are equal in amplitude and adjusted for 7/2 retardance at the ana- 
lyzer. At this optimum bias and a crystal aspect ratio of 50 a small 
signal beam modulation depth of 8 percent per volt can be obtained. 
With such an arrangement satisfactory modulation depths of approxi- 
mately 20 to 30 percent can be obtained with just a few volts of 
modulation potential. 

In an actual system in which the modulated beam is detected via 
a photodetector which is coupled into a transmission structure of 
characteristic impedance Z, , the system transfer function (defined as 
the differential ratio of modulation potential to potential appearing 
across the output transmission line) for a quasi small signal mode of 
operation is derived from equation (2) as 


t= EER z, @) 


where q is the electronic charge, h is Planck’s constant, and c is the 
velocity of light. The above relationship, based on the generation of 
one hole-electron pair per photon, assumes zero optical losses and a 
100-percent quantum efficiency in the photodetector and works out to 
be approximately 0.5 per watt of beam power. Clearly, high beam 
powers will enhance the system transfer function coefficient, equation 
(3) ; however, a limit is imposed by optically induced inhomogeneities 
to the birefringence of the crystal. In an experimental system an 
upper limit of approximately 30 mW is realistic which gives a transfer 
function coefficient of 0.015 (not allowing for transmission losses) 
which for a modulation signal of several volts results in a satisfactory 
output amplitude. Satisfactory signal amplitude levels within the 
system, of course, depend on the system noise levels; in this respect, 
the presence of mode competition in a free-running laser system 
results in a very noisy channel. Two solutions to this problem are 
available: mode locking"! and etalon tuning.t? Mode locking effec- 
tively locks the longitudinal modes of the laser to produce a train of 
optical pulses at the frequency separation of the longitudinal modes, 
c/2l, where | is the length of the laser cavity. Optical communica- 
tions systems incorporating mode-locked lasers offer some advantage 
and have been successfully demonstrated.t?** Mode-locked laser sys- 
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tems offer the availability of a stream of very narrow pulses of width 
inversely proportional to the oscillation bandwidth of the laser which 
can result in pulses of just a few picoseconds. 

A disadvantage of the mode-locked system is that optimum modula- 
tion depths and satisfactory base line definition are not compatible 
with low-voltage modulation since, in general, it is desirable in these 
systems to supply a modulation potential which is a substantial frac- 
tion of the half-wave retardation potential. One further disadvantage 
of mode-locked regime in gas lasers is the relatively low repetition 
rates of the optical pulses: This repetition rate is related to the fre- 
quency separation of the longitudinal modes, c/2l, and at short 
resonator spacings (required for high pulse repetition rates) the gain 
of gaseous lasers is insufficient to sustain oscillations. While tech- 
niques exist for effectively increasing this repetition rate by, for 
example, multiplexing’® or harmonic pumping of the mode-locked fre- 
quency,’? neither technique is a well established laboratory practice 
and the implementation of reasonably high rates (i.e., 1 GHz) is not, 
at present, conveniently realizable. 

Relative noise immunity can be obtained with an etalon tuner by 
effectively selecting one pure optical frequency from the Gaussian 
line distribution. This is obtained by deforming the resonant poles of 
the laser so that only one conjugate pair lies within the laser gain 
curve.!® The consequence of this type of operation is that mode com- 
petition is effectively eliminated and beam noise current is observed to 
be less than 5 percent of the total beam current. With such low noise 
powers, low modulation depths can be quite acceptable and a partial 
modulation scheme readily implemented. 

There is no clear optimum in the choice of a particular optical 
frequency as a system carrier. While advantageous reductions in the 
modulator half-wave voltage can be obtained with shorter wavelengths 
[equation (1)], the quantum efficiency of the detector is in general 
reduced at these wavelengths (i.e., below 800 nm for 81). Focusing 
requirements are relaxed somewhat with shorter wavelengths since 
narrower diffraction-limited waists are obtained for a given far-field 
focusing angle. There is not, as yet, sufficient evidence for an optimum 
wavelength from the standpoint of optically induced changes to the 
birefringence. The absorption lines in LiTaO 3 are spaced such that, 
for gaseous laser output frequencies, the crystal is quite transparent 
and the choice of frequency is mainly determined by available powers 
from the various laser systems. In the system under discussion an ion 
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laser (argon) operating at 496 nm has been found convenient. This 
wavelength is somewhat below the characteristic peak in quantum 
efficiency for silicon photodetectors (800 nm), but nevertheless quan- 
tum efficiences of 0.3 A/W can still be obtained. 

Coding for optical communications systems has so far received 
little attention. In the past the use of mode-locked lasers has given 
preference to a binary PCM coding technique. Recent studies!® indi- 
cate that polarization multiplexed, phase shift keying, or pulse position 
modulation may be more appropriate to optical communications sys- 
tems from a transmission standpoint. Problems still remain in imple- 
menting these modulation schemes. However, a frequency modulation 
scheme for space applications has recently been described?° which 
shows promise for potentially wide bandwidths. Because of the ad- 
vantages of regeneration, PCM systems appear attractive and in low- 
noise channels multilevel systems can be considered. The present 
system uses a binary type PCM coding scheme but is readily adapt- 
able to multilevel with little increase in complexity. 


Ili. OPTICAL ARRANGEMENTS 


The optical arrangement used is of a fairly conventional nature 
employing a type of Senarmont compensator** as shown in Fig. 2. 
The argon laser provides a single frequency output of approximately 
30 mW, plane polarized in the vertical direction. It is easy to show 
that a necessary condition for achieving optimum modulation is that 
the beam be circularly polarized at the analyzer plate. The conse- 
quence of a left- or right-hand polarization state is merely to induce 
a 180-degree phase inversion of the detected electrical signal. The 
circularly polarized state can be obtained by ensuring that the ordi- 
nary and extraordinary beam component amplitudes are equal with 

_the necessary relative phase retardance. This is achieved by a 45- 
degree setting of the quarter-wave plate with an incident beam plane 
polarized at an angle 6/2 which is adjusted by the half-wave retarda- 
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Fig. 2—Standard optical system with Senarmont type compensation. 
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tion plate. This particular arrangement can be easily checked by 
reference to the Poincaré-sphere.?? Under these conditions, the x and y 
components of the beam are given by: 


. (cet 7 
B, = v3 A sin (24 8) | 


(4) 
B, = v3 Asin (% + 2) 
hn 2 

The exact angle, 6/2, is selected to compensate for the natural 
birefringence of the LiTaO; crystal. The electric-field-induced per- 
turbations of the natural birefringence of the modulator rod translate 
to perturbations of the relative phase retardation of the x and y com- 
ponents of the exiting beam to produce a modulation of the beam 
ellipticity state. Collimation of the beam is provided to permit distant 
transmission. The analyzer may be placed before or after collimation 
depending, on whether it is desired to transmit amplitude or polariza- 
tion state modulation. At the receiver the beam modulation is con- 
verted to an electrical signal via the silicon avalanche photodetector. 
The avalanching mechanism provides a signal gain of only 2 or 3 due 
to the high optical power levels. In a detector operating at low levels, 
resulting from, for example, atmospheric attenuation of the beam, 
much higher gains should be possible. This is a desirable feature of 
the avalanche detector in that it provides a self-limiting or automatic 
gain control on the signal. 

Efficient focusing of the beam for low-loss transmission through 
the modulator crystal is important. The problem can be dealt with 
in a number of ways and one method is given by Kogelnik and Li** 
in terms of the confocal parameters for optimum clearance. Another 
approach is to consider the percent transmission of power through 
the rod. Consider a rod of circular cross section, as shown in Fig. 3a, 
with an incident Gaussian beam with field distribution given by 


_ —[(22+y?)/w?] 
EH = Eve 7 ee J (5) 


where w is the width of the beam at the 1/e points. The intensity 
distribution, J, is therefore given by 


T a Le eee: (6) 


The total power in the beam, J; , is computed from 


fey hf a ay (7) 
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Fig. 3—(a) Modulator configurations. (b) Diffraction-limited beam waist. 


which reduces to 


Iorw" 


I, _ 9° (8) 


Now, for the case of the circular cross section rod we can compute the 
rod dimension for 99-percent power transmission from 


2 d/2 
0.99( Za") = I, i en?!" Oar dr, (9) 
0 
which, after a parametric substitution, reduces to 
d 
a= 1.5w. (10) 


In a similar manner, the more practical case of a square cross section 
rod (Fig. 3a) can be dealt with, i.e., 
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0.09 Janu!) _ I, i ie Bans dar ai: (11) 


which, again after a parametric substitution, reduces to 


a = 1.37. (12) 


Clearly, the circular structure is more efficient from the standpoint 
of power transmission; however, the practical problem of applying 
uniform transverse sleciic field prohibits its use. 

In matching the optical beam to the modulator rod, most power 
loss, due to dimensional limitations, will take place at the faces of the 
modulator due to beam expansion. Kogelnik has analyzed the expan- 
sion of focused Gaussian beams in some detail.2* Figure 3b shows a 
typical beam waist at the focus point. The equations governing the 
beam parameters in this diffraction-limited case are: 


oN 
Wo = =, ’ (18) 
where 6 is the far-field diffraction angle; 
273 
w(e) = wi] 1 + (2) (14) 
and 
2 
neo = 1+ (52) |, a9 


from which the confocal parameter, b, is obtained as the distance from 
the phase front radii minima as 


Dy. 
TO" 

The beam expansion can be completely specified, for a given wave- 
length, in terms of far-field diffraction angle, 6, which is, of course, 
the focusing cone half angle. For the case of a collimated beam? inci- 
dent on the lens, @ is given by 


b= (16) 


§ = radians, 


4/1 


where w is the beam radius at the lens and f is the lens focal length. 


+The description “collimated,” within the context of Gaussian optics, here 
means a beam wavefront radius much greater than the focal length of the lens. 


2616 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1971 


CONFOCAL PARAMETER, b, IN INCHES 
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Fig. 4~Beam width of a modulator ends as a function of focusing cone angle: 
(a) Ao = 496 nm; (b) Ao = 632.8 nm. 


Equation (14) may be used to calculate the beam radius at faces of 
a rod of length Z for a given 6. This has been done for the case of 
dX = 496 nm and A = 632.8 nm and the results are presented in Figs. 4a 
and b respectively. For each modulator rod length, a clear optimum 
cone angle is indicated. This optimum angle gives the same confocal 
parameter (on the upper abscissa) as Kogelnik’s analysis. The ordinate 
on the right gives the necessary rod cross section for 99-percent optical 
power transmission in accordance with equation (12). Even for the case 
of the 1-inch rod with an aspect ratio of 100, a wide range of focusing 
cone angles are permitted for a high optical power transmission. In 
practical cases, criteria for high-efficiency transmission through the 
rod are easily met and other criteria assume importance. It is interest- 
ing to note that the curves for clearance in Figs. 4a and b coalesce 
for all rod lengths at a cone angle less than 1.5 milliradians. Thus, in 
this case, the transmission of optical power becomes relatively inde- 
pendent of the rod length. 

One other criterion important in beam matching is that of non- 
uniqueness in the beam polarization state across the width of the 
beam. This arises as a result of the differing path lengths experienced 
by the various portions of the beam in the rod. Consider the wavefront 
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diagram shown in Fig. 5a. The optical path length of a particular 
portion of the beam in the modulator can be determined by a count 
of the wavefronts. Clearly, the path lengths are longer for the extrem- 
ities of the beam than for the center. This is of no import in an 
isotropic media but in a crystal exhibiting birefringence, the result 
is a nonunique polarization state across the section of the beam. 
The problem of the propagation of Gaussian beams in anisotropic 
media has been analyzed in some detail?> but it is sufficient here to 
adopt a somewhat simplistic argument to assess the quantitative 
effects of the nonuniqueness. The effect is important since it is at its 
maximum where the wavefront radius is at its minimum, 1.e., when the 
length of the rod is equal to the confocal parameter, b. Consider the 





(a) 





Fig. 5—(a) Wavefront radii at beam waist point. (b) Gaussian beam at. sur- 
face of anisotropic media. 
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Gaussian beam incident at a flat face modulator crystal of length L 
as shown in Fig. 5b. The radius of curvature of the wavefront at the 
face of the crystal is given by 


For the important case of the rod length equal to the confocal param- 
eter, for optimum clearance, equation (17) reduces to 





R(L/2) = . E e (2)'| oy (18) 


A quantitative measurement of the nonunique polarization state of 
the beam can be obtained by determining the differences in relative 
phase retardation between the center and the Hy/e points of the beam. 
The radius of the beam at the face of the crystal is determined from 


v= wfisCeZy}, on 


w(4) = Vz (20) 
vis 
for the confocal case. 


The construction shown in Fig. 6a is useful in determining the non- 
uniqueness of polarization state for the above case. The situation 
is more complicated than that suggested by this simple construction in 
that the radius, R, is dependent on the beam polarization state as is 


which reduces to 


(b) 





Fig. 6—(a) Evaluation of nonuniqueness in beam polarization state. (b) 
Modified modulator cross section to compensate for crystal birefringence. 
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also the beam angle in the crystal. Nevertheless, an approximation to 
the excess distance experienced by the E,/e points of the beam is 
represented by the distance, a. The distance a is given by 


a= R| 1 — (V1 = wR) |, (21) 


which, for w/R < 1 reduces to 


2 
w 
a= oR (22) 
Substituting equations (18) and (22) results in 
A 
ae (23) 


and the relative differential path length, Aa, resulting from this excess 
distance is 


Aa = * (n. — N) (24) 


(the effect is doubled due to the summation of the input and output 
excess distances). For LiTaO; this amounts to only 0.0004 A, a 
negligible phase retardance. For other materials, such as LiNbO or, 
more particularly, deuterated KDP, the effect will be much stronger 
due to the increase in the (n- — n,) factor. The effect is analogous, in 
effect, to a built-in electric field, and where the polarization non- 
uniqueness is degrading, a modified cross section (Fig. 6b) can be 
adopted so as to produce a compensating nonlinear electric field across 
the modulator rod. 

Of more practical significance in matching the beam to the modu- 
lator structure is the peak power intensity produced at the diffraction- 
limited beam waist which can give rise to optically induced inhomo- 
geneities of the natural birefringence.® A field annealing process can 
reduce this susceptability?® so that peak power intensities of 500 
W/cm? are tolerable. The peak power intensity as a function of 
focusing cone angle is given in Fig. 7. This power intensity is ob- 
tained by combining equations (8) and (13) to give 

2 2, 
1, = 222 W/em’, (25) 


0 





where @ is the internal focusing cone half angle. The strong depend- 
ence of power intensity on cone angle dictates a large confocal length, 
consistent with satisfactory clearance. A safety factor on clearance 
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Fig. 7—Peak power intensity as a function of beam focusing angle. 


has been proposed® which, in general, will restrict the minimum cone 
angle to some value less than the theoretical optimum indicated in 
Figs. 4a and b. For modulator rods of 10 x 10 mils cross section an 
internal half cone angle of 1.5 milliradians provides adequate clear- 
ance whilst maintaining satisfactory power intensities with beam 
powers of approximately 30 mW. 

In summary, the matching of the beam to the modulator crystal is 
influenced by three criteria, viz. 


(t) High-efficiency optical power transmission 

(iz) Uniqueness of beam polarization state 

(iz) Limitation of peak optical power intensity at the beam waist. 
Separate optimization of items 7 and 2 give rise to directly conflicting 
matching conditions. Fortunately, the degree of nonuniqueness of 
polarization state is not significant, for LiTaO; , even for the worst 
case condition when the confocal parameter b equals the modulator 
rod length ZL. Further, the optimum exhibited for efficient optical 
power transmission is weak, and a wide range of focusing cone angles 
is allowable for very high efficiencies. Dominant in practical systems 
employing LiTaO; is the peak power intensity which will compromise 
the optimum condition for high power transmission. In cases when it 
is not possible to satisfy item iz and simultaneously maintain high- 
efficiency optical clearance, the crystal may be operated at an ele- 
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vated temperature (107°C) where the effect of optically induced 
birefringence changes is observed to relax at a rate faster than its 
creation.® 


IV. INTERFACING—-MODULATORS 


Efficient interfacing of solid state circuitry with the electro-optic 
modulator is vital for high information capacity optical communica- 
tions systems. In a practical system, the interfacing problem reduces 
to matching a transverse Pockels’ effect crystal (lithium tantalate) 
to the impedance environment of the silicon driving circuits to achieve 
an efficient transfer of information onto the optical beam. A common 
approach in the past has been to regard the modulator as a lumped 
parameter structure which presents a reactive driving point im- 
pedance.2728 The major disadvantage of this approach is that not 
only must electrical charge be injected into the structure, but it must 
also be withdrawn by the driving circuits to restore the modulation 
field to its initial state. To accomplish this charging and discharging, 
at high data rates, impractically large charging and discharging cur- 
rents are necessary. Further, the capacitive load will create a mis- 
match in the driver-modulator transmission path resulting in standing 
waves with consequent modification of the impressed waveform. 

A different approach is to make use of the intrinsic characteristic 
impedance of approximately 55 ohms which is exhibited by LiTaO; . 
This impedance can be realized by rearranging the modulator struc- 
ture to make use of the inherent distributed parameter structure of the 
modulator crystal. The resulting modulator will be of the traveling- 
wave type,?*%° which presents a real driving point impedance so that 
an efficient interface is obtained with the impedance levels of high- 
speed silicon transistor circuits (50 ohms). Previously traveling-wave 
type interactions have been adopted to increase the modulator band- 
width;*! however, in this application their primary use is to provide 
improved impedance matching. The improved bandwidth appears as 
a secondary effect as will be discussed later. 

The exact characteristic impedance of the modulator structure can 
be determined from the differential capacitance and inductance along 
the length of the crystal. The capacitance per unit length is simply 


eae: 4 F/m, (26) 


where b and c are the crystal dimensions along the b and c optical 
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axes respectively. This, of course, assumes zero fringing of the electric 
field outside the crystal which is justifiable since e, >> 1. The inductance 
L due to the two parallel crystal electrodes of length ¢ is” 
2 
n= 1] toe, = log, at 1.5 — log, e + log, k — a ue 27) 
where log, e and log, k are functions of b and c given in Ref. 32. For 


a modulator crystal of 10 mils square cross section, the inductance 
per unit length reduces to 


2 
a = (6.3 — z) nH/cm. (28) 
So that, combining (26) and (28), the characteristic impedance, as a 
function of distance, is 
2 
(03 ~ p) 


€€,0 X 10° 


for c’ < ¢’, i.e., at some distance from the ends of the modulator. The 
exact variation of characteristic impedance along the length of the rod 
is given in Fig. 8. The deviations of Z) at the rod ends are in fact ab- 
stractions, which will not be exhibited by a practical structure due to 
the presence of connections to the electrode end points which results 
in a more complex situation. It is assumed in this calculation that there 
is no dispersion of the dielectric permittivity. Experimental high- 
frequency measurements have indicated that this is the case* and 
further experiments have extended this result to 6 GHz.™* 

The group velocity differences. of the electrical and optical signal 
velocities in the modulator will give rise to a bandwidth limitation. 
This limitation is readily analyzed by consideration of the situation 
depicted in Fig. 9. The voltage experienced. by an optical wave at a 
point z in the modulator rod is 


Z(t) = = 41 ohms (29) 


Vi) = V Sin | ae Ae ie 2) (30) 
Ve Vv, 
where », and », are the velocities of the light and electrical signals 
respectively in the modulator rod. For a rod length, L, the total induced 
phase retardation of the optical beam is, therefore, 


+L/2 aV : ( :,) 
oro = [. VL Sin wl ¢ + x dz, (31) 





OPTICAL MODULATION 2623 


CHARACTERISTIC IMPEDANCE 
IN OHMS 





ie} es 
ie) 100 200 
MODULATOR ROD LENGTH,L, IN MILS 


Fig. 8—Modulator characteristic impedance as a function of distance along 
crystal. 





where 
1 1 1 
yo Ve 7 an (32) 
hence, 
2rVv’ 4. wl «q. 
oto = VoL Sin 57 Sin wt (33) 


which is of the familiar form shown in Fig. 10a. This amplitude 
spectrum results in the Dirac response of Fig. 10b and the electro- 
optic transfer function step function response time is, therefore, 


t, = x4 — 4). (34) 


Ve Ve 





Vg = VSinwt 


Fig. 9—Traveling-wave type modulator structure. 
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Fig. 10—(a) Amplitude spectrum of traveling-wave modulator. (b) Dirac 
response. (c)System response to pulse width 7. 


Typically, for a 1-cm-long modulator crystal a rise time of 150 ps 
is indicated. . 

The modulator bandwidth can be increased with velocity compen- 
sation of the light and electrical signals. Kaminow, et al.,35°¢ and 
Peters** have described some techniques for accomplishing this type 
of compensation. The proposals suffer from some disadvantages in 
practical realizations, including a loss of modulation power, fabrica- 
tion problems in achieving high-efficiency power transmission, and, in 
particular, the latter proposal*’ could result in multimode propagation 
of the electrical modulation signal in the modulator structure. In 
practice, the modulator bandwidth limitation is not significant, with 
respect to system bandwidth, even in high information rate systems 
(1 Gb/s) and it is not necessary to resort to velocity compensation 
techniques. This is easily shown by considering an input pulse, f,(¢), 
of width T where 2T is the system information rate interval, i.e., 


fi) = ALH® — Ht -— 1), : (35) 
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denoting the unit step function as H(t). The system Dirac response, 
fe(t), has been shown to be 


f(t) = B[H() — H(t — R)], (36) 
where 


R= Z(d—n). (37) 
The system response, fo(t), to f,(t) is given by the convolution 
fo(t) = fi(d)*fo(d) 
= [ A[H(2) — H(r — T)|BIH(t — 7) — H(t -—R — 2) dr (38) 


from which it is obvious, from the graphical form of the convolution 
integral, that fo(t¢) is as shown in Fig. 10c. So that, providing R S 7/2 
the modulator bandwidth is not limiting; this is clearly so at informa- 
tion rates of 1 Gb/s. 

It is important to observe from (33) and (34) that a trade off 
exists between modulator bandwidth and half-wave voltages so that 
it is possible to employ velocity compensation to permit longer 
crystals (while maintaining the same bandwidth as the uncompensated 
structure) to achieve advantageous reduction in the half-wave voltage. 
This trade off is also possible with different modulator materials. In 
fact, a gain-bandwidth product can be assigned to a particular modu- 
lator material. With this figure of merit, the gain is related to the 
inverse half-wave retardation voltage and the bandwidth is related 
to the optical refractive index and the microwave dielectric permit- 
tivity, i.e., 


eee Seer 
~ (n— &)V, 
This figure of merit is given for a number of modulator materials 
in Table I. The situation is more complex, however, than is suggested 
by a numeric value of the gain-bandwidth product: GaAs, for ex- 
ample, only exhibits below bandgap transmission for » > 900 nm 
and GaP incorporates a dielectric relaxation time which can be as 
large as 10 ps.*® Further, the mechanical properties of the materials 
dictate the feasibility of fabrication of large aspect ratio crystals to 
effect a gain-bandwidth trade off. Other factors, including free carrier 
optical absorption and optically induced crystal damage, further 
complicate the choice of material. 


GB (39) 
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TABLE I— Figure oF MERIT FoR DIFFERENT MopuLATorR MATERIALS 























Material Type n(av) ér V,t | GB(V- x 1075) 
LiTaO; Ferroelectric Perovskite | 2.18 43 2130 10.7 
LiNbO; Ferroelectric Perovskite 2.25 30 2340 13.3 
KH2PO, KDP?0 1.5 21 4350 7.5 
GaAs AB-Type Semiconduc- 

tor 3.5 12.5 3870 0 
GaP AB-Type Semiconduc- 
tor 3.4539 | 11.138 | 7830 0 





t Extrapolated to \ = 496 nm. 


LiTaO; represents a compromise of the gain-bandwidth product and 
the other factors while still providing significant advantages in re- 
sponse time when incorporated in a traveling-wave structure compared 
to the lumped parameter configuration. Equation (34) can be used to 
compare the performance of the traveling-wave LiTaO; modulator 
with the electro-optic transfer function rise times obtainable with 
lumped parameter configurations operating in the normal 50-ohm 
environment. The rise times are, of course, a function of modulator 
rod length as is seen in Fig. 11. Even for the double-pass-type lumped 
parameter structure,* the response times of the uncompensated 
traveling-wave structure are significantly superior. Typically, for a 
modulator rod length of 500 mils, the rise times are 150 ps, 250 ps, and 
500 ps for the traveling-wave, double-pass lumped parameter, and 
single-pass lumped parameter structures respectively. 

The traveling-wave modulator structure has been realized in micro- 
strip thin film form on an alumina (Al,03) substrate as shown in Fig. 
12. The alumina substrate has microstrip transmission lines of 50 
ohms characteristic impedance etched in the evaporated chrome-gold 
layers with the reverse side metallization removed below the modulator 
rod. Connections are brought through the substrate from the reverse 
side metalization to make contact with the upper contact lands. The 
modulator crystal has evaporated electrodes, typically chrome and 
palladium, on opposite faces along the longitudinal axis. Placement 
of the modulator crystal, with the 6 optical axis perpendicular to the 
surface of the substrate, is facilitated with low-temperature solder 
which also provides acoustic damping.** The electrical modulation 
signal is applied via the coaxial-to-strip-line transition, the strip line, 
and the rod, and exits from the substrate to be absorbed in a matching 
termination. The possibility exists, however, for reapplying the elec- 
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trical modulation signal to further cascaded modulator structures 
without the need for any additional electrical power. In this manner, 
not only can increased modulation depths be obtained but some degree 
of compensation can be obtained for the electrical and optical 
velocity differences by adjusting the relative electrical and optical 
delays between the two modulators as shown in Fig. 13. 

A photograph of an actual modulator structure with a 500-mil- 
length rod is shown in Fig. 14. The time domain reflectometry (TDR) 
trace of the characteristic impedance throughout the modulator is 
presented in Fig. 15a. Small perturbations in characteristic impedance 
due to the coaxial-to-strip-line transitions are observed with a much 
larger deviation from the 50-ohm impedance level along the length of 
the crystal. The crystal impedance level (~ 30 ohms) is less than 
indicated by equation (29) and is partially attributable to the electric 
field deformation due to the close proximity of the high permittivity 
(e, = 9.7) alumina substrate. The time resolution of the TDR system 
is 30 ps and the relative slowness of the drop to the 30-ohm level 
indicates the possibility of a dielectric relaxation mechanism in the 
crystal. The impedance mismatch results in a reflection coefficient, p, 
of 0.25 or a VSWR of only 1.65, and no significant deformations of 
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LUMPED ELEMENT, 
SINGLE PASS 


(bo) 
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Fig. 11—Rise time as a function of modulator rod length for traveling-wave 
and lumped parameter single- and double-pass systems. 
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Fig. 12—Traveling-wave modulator embedded in microstrip transmission lines. 





Fig. 13—Cascaded traveling-wave modulator structure. 
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Fig. 15—(a) TDR characteristics for LiTaO: modulator (100 ps/em). (b) Elec- 
trical pulse transmission through modulator (100 ps/em). (c) TDR characteristics 
for LiNbOs modulator (200 ps/em). 
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the modulation waveforms are observed in the actual system as a 
result of the nonunity VSWR. The high-frequency attenuation of 
the electrical signal after traversing the modulator structure is illu- 
strated in the time domain response of Fig. 15b. The rise time is 
significantly less than 100 ps indicating an electrical band pass in 
excess of 4 GHz. 

Lithium niobate, with a lower permittivity, produces a character- 
istic more nearly approximating the desired 50-ohm impedance level 
as indicated in the TDR trace of Fig. 15c. This is for a 360-mil-length 
rod with a 10 X 10-mil cross section mounted in a similar traveling- 
wave structure. 

Modulator rods of 60:1 aspect ratios (10 x 10 x 600 mils) have 
been fabricated without any noticeable evidence of stress induced 
inhomogeneities to the birefringence and the actual system perform- 
ance to be described employs this length of modulator crystal. 

No attempt has been made to compensate for the strong tempera- 
ture dependence of the electro-optic effect in LiTaO3. Small signal 
modulation reduces this problem since RF heating is negligible. 
Optical power absorption and ambient temperature changes can be 
significant and in a practical system the cascaded, temperature com- 
pensated structure of Peters*?*? can be employed. This scheme has 
recently been realized in a 10 X 10-mil cross section crystal in a 
traveling-wave type structure.** 


V. HIGH-SPEED ELECTRONIC DRIVING CIRCUITS 


At present, a major factor in restricting the information capacity 
of optical communications systems is the speed of operation of the 
electronic driving circuits. It is in this area that effort will result in 
information rate increases. The present system uses an electronic 
multiplexing and gating process to achieve the 1-Gb/s PCM word 
patterns necessary to produce the overall system information capacity. 
Optical multiplexing’® has been proposed as a means of increasing 
system information capacity and it is possible that this multiplexing 
technology could complement the electronic multiplexing described 
here. Indeed, such a combination may be preferable to a wholly optical 
multiplexing scheme since the complexity of the optical arrangements 
would be reduced. 

The general schema of the electronic PCM system is shown in Fig. 
16. Here basic system timing is provided by a master 250-MHz sine- 
wave oscillator. After division by a quarter-wave transformer to 
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Fig. 16—Schema of electronic PCM system. 


achieve impedance matching, the waveform provides system synchro- 
nization and drive to four Boff (snap-off) diode subnanosecond spike 
generators. Amplification and variable delay units are provided for 
adjustment of the phase and amplitude of the four channels. The 
amplifiers are standard class C tuned stages providing approximately 
8 db of gain and the variable delay units are standard variable-length 
coaxial lines. The timing of the four waveforms is arranged so that 
the spike pulses fall in sequence into each of four time slots or win- 
dows, 1 ns wide. In this manner the pulses can be multiplexed in an 
or gate multiplexer to produce a bit rate of 1 Gb/s. Information is 
impressed onto the pulse trains in the individual channels in “informa- 
tion AND gates.”” With this technique the information signals can be 
of a relatively slow nature (i.e., rise and fall times of 2 ns) at a bit 
rate of 250 Mb/s. Logic processing prior to impressing onto the spike 
trains is already available in standard emitter-coupled integrated 
circuit form. 

The realization of some of the constituent parts of this system repre- 
sents some difficult design and fabrication problems and it is pertinent 
to discuss in some detail these subsystem elements. 


5.1 Boff Diode Spike Generators 

The high-speed subnanosecond spikes are generated in a tuned cir- 
cuit excited by the abrupt cessation of carrier storage in the Boff diode. 
The generator is of a conventional shunt-mode type which is well 
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documented.*#° The actual circuit form is shown in Fig. 17a. The . 
width of the pulse is determined by the half period of the resonant 
circuit formed by ZL; and C;. £,, and C,, provide matching to the 
sine-wave source whilst R, sets the dc bias. 

Amplitudes of approximately 10 volts are obtained with pulse base 
widths of 700 ps. Significantly narrower pulse widths are easily 
obtainable. with the promise of higher orders of multiplexing and 
concomitantly higher bit rates. The prime limiting factor to this type 
of extrapolation, however, is the transmission bandwidth of the infor- 
mation gate. A typical spike generator output waveform is shown in 
Fig. 17b. Figure 17c shows the output waveform of the four channels 
with the pulses positioned in their relative 1-ns time slots. The circuit 
is realized in conventional strip line construction with probed Patani 
eter reactances. 


5.2 Information Gates 

The function of the information gate is to transmit or inhibit indi- 
vidual spike waveforms at a bit rate of 250 Mb/s in response to a 
“1” or a “O” on the information signal. High-speed considerations 
would normally suggest a passive gate employing Schottky barrier 
diodes; however, for the case of equal input and output impedance 
it is not possible to obtain a passive gate gain in excess of 0.5. In 
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Fig. 17—(a) Boff diode spike generator. (b) Single-channel output waveform 
(1 ns/em, 2 V/em). (c) Four-channel output. in relative time slots (2 ns/cm, 


5 V/cm). 
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contrast, active gates can give gains in excess of unity although a 
more limiting bandwidth is imposed. The bandwidth limitation can 
be minimized to acceptable levels with the current routing pair con- 
figuration which has been used in this system. The simple circuit form 
is shown in Fig. 18a. The actual circuit incorporates a number of de 
restoration networks so that little demand is made on the entropy of 
the information signal. 

The high-speed spikes with frequency components in excess of 1 
GHz are applied to the common emitter node via a terminating 
resistance resulting in current pulses with amplitudes of the order 
100-200 mA. Current routing between transistors Q,; and Qe is con- 
trolled by the base potential of Q; which is determined by the ampli- 
tude of the information signal. With this configuration Q». operates 
in essentially the common-base mode so that the full common-base 
current gain cutoff frequency of the device can be realized. In contrast, 
the switching action in response to the information pulses takes place 
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IN 
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Fig. 18—(a) Information gate. (b) Information gate switching: top trace, 
output; bottom trace, information input (2 ns/em, 2 V/cm). 
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at the base where switching speed is commensurate with the rise times 
_ of the information signal. 

As a result of the large-amplitude currents being switched in the 
devices, high-level injection effects occur. The problem then is that 
phenomena, such as emitter crowding, transverse voltage drops*® in 
the base, and the Kirk effect,‘7 produce device parameter degradation. 
These high-level injection effects are still significant in the nine-stripe 
geometry devices used. Device parameter degradation has two effects, 
first, a bandwidth limitation which in turn produces a reduction in 
output pulse amplitude, and second, emitter-to-base voltages which 
become quite large so that complete current switching from one de- 
vice to the other is not possible. A consequence of the latter effect is 
that it is not feasible to switch off Q; at any time (due to reverse 
emitter-base junction breakdown of Qe), and Qe is only switched off 
in response to large (5 V) pulses at the base of Q,. This degenerate 
switching mode of operation is satisfactory however, since only the 
output of Qe is utilized. The overall capabilities of the information 
gate are a switching time of less than 2 ns with transmission band- 
width in excess of 1 GHz for output pulse amplitudes of 7 volts. 
Typical switching action is indicated in Fig. 18b. Here, information 
impression on the spike pulse train is shown (upper trace) for a 10101 
word pattern, in response to the relatively slow information signal 
(lower trace). It is to be noted that the base width of the exiting 
word pattern pulses is still significantly less than 1 ns so four-channel 
multiplexing is still efficient. The circuit is realized in thin film micro- 
strip with thin film chrome resistors and beam leaded transistors. The 
actual structure is shown in Fig. 19. The devices used had a cutoff 
frequency of approximately 4 GHz and a collector-base capacitance 
of approximately 0.3 pF. 


5.3 1-Gb/s Multiplexing Gate 

Multiplexing of the four information bearing pulse trains can be 
accomplished with a four-input ‘“‘or” gate. “or” gates, unlike “anp” 
gates, can have unity gain with passive components. The use of pas- 
sive components relaxes somewhat the high-speed circuit requirements 
for the multiplexer. In particular, Schottky barrier diodes can be used 
to advantage for the multiplexer. The low carrier storage time of 
these devices permits the high-speed circuit performance required. 
The actual circuit used is a simple four-diode structure -in microstrip 
as shown in Figs. 20a and b. This particular hybrid construction used 
leadless inverted device (LID) packaging of the devices. 
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P Fig. 19—Information gate realization in thin film microstrip with beam lead 
evices. 


A typical set of input waveforms to the multiplexing gate is shown 
in Fig. 21a. Here, the four channels are positioned in their relative 
l-ns-wide time slots with a 10101 word pattern on channel 1. The 
multiplexed, 1-Gb/s output is shown in Fig. 21b. Little degradation 
in wave form speed is apparent. The speed limiting factors are 
Schottky barrier diode storage time and junction capacitance. The 
former can be significantly less than 100 ps** and the latter can be a 
fraction of a picofarad.*® In cases where a large number of channels 
are multiplexed, the junction capacitance can become the dominant 
limiting factor since it is a cumulative effect. Diode barrier potential 
and differential “on” resistance result in some amplitude attenuation 
which attenuates output pulses to 5 or 6 volts. 

The output from the multiplexing gate is applied, via a coaxial line, 
to the LiTaOs; traveling-wave modulator. 


VI. OTHER REALIZATIONS 


Other electronic realizations of the circuit functions described are 
possible which should provide improved performance at higher data 
rates with higher modulation depths. An example of an alternative 
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Fig. 20—(a) Multiplexing gate “or” circuit. (b) Multiplexing gate realization 
in thin film microstrip. 







CHANNEL 4 
CHANNEL 3 fe 


CHANNEL 2 §@ 


CHANNEL 1 gg 





(a) “ (b) 


Fig. 2i—(a) Input waveforms to multiplexing gate (2 ns/em, 5 V/cm). 
(b) Upper trace: 1-Gb/s multiplexed output; lower trace: channel 1 input 
word pattern (2 ns/cm, 2 V/cm). 
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realization is shown in Fig. 22. In this realization the functions of 
the information gates and the multiplexer are combined by taking 
advantage of inherent multiplexing capabilities of the collector output 
circuits. In this manner, attenuation resulting from the multiplexer 
diode barrier potential and differential ‘‘on” resistance is eliminated. 
High-level injection effects in the information gate devices can be 
reduced by cascaded structures so that significantly higher amplitude 
modulation waveforms can be obtained. Such realizations could lead 
to “on-off” systems with very high data rates (in excess of 1 Gb/s). 
The limiting factor in these realizations is the cumulative addition of 
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Fig. 22—Alternative realization of information gates and multiplexer functions. 
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collector output capacitance but present-day devices are available 
with values of capacitance significantly below a picofarad and time 
constants of less than 100 ps should easily be obtained. 

With the previously described circuit elements, multilevel coding 
is readily implemented. This form of coding is achieved merely by 
supplying spike waveforms with the necessary quantization in ampli- 
tude levels. The system depicted in Fig. 23 is such a realization for a 
four-level coding scheme. Not shown is the conversion logic which is 
required at the inputs to the information gates but since this is at a 
relatively slow data rate (250 Mb/s) it is easily accomplished with 
standard integrated circuits. Multilevel PCM coding schemes are 
indeed attractive in view of the low system noise as will be discussed 
later. 
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Fig. 23—Multilevel PCM coding schema. 
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VII. SYSTEM PERFORMANCE 


The complete system was established in accordance with the basic 
schemas of Figs. 2 and 16. Optical loss and beam parameters are 
important factors in such systems where efficient use of beam power 
and optical component matching are required. Losses through passive 
optical components are relatively unimportant (typically 1 percent or 
less per surface). However, modulator losses are significant because 
of optical absorption and beam blockage due to large aspect ratio 
modulators. To achieve efficient optical matching, exact knowledge 
of the beam parameters is required; this was obtained by two experi- 
mental techniques, viz., slit measurements and edge measurements 
similar to those of Arnaud, et al.°° Both techniques made possible 
reliable estimates of the beam intensity profile. Typical of these 
measurements is the experimentally obtained diffraction-limited waist 
pattern shown in Fig. 24 which compares well with the calculated 
waist pattern [from equation (14)] also presented. Such exact knowl- 
edge of beam parameters enabled modulator losses as low as 0.5 dB to 
be obtained. In practice, however, low-loss transmission does not co- 
incide with high-efficiency modulation depths but, nevertheless, losses 
as low as a few decibels are obtainable. 

Practically realizable system transfer function coefficients of ap- 
proximately 0.17/W were obtained as opposed to 0.5/W predicted 
by equation (3). This lower value is accounted for primarily by the 
nonlinear modulation mode. Other factors, including reflection from 
the exiting face of the modulator crystal and analyzer losses, also 
reduce the coefficient. Nevertheless, output voltages of 20 mV result 
from applied modulation potentials of 4 V. 

The total system performance at a bit rate of 1 Gb/s is illustrated 
in Fig. 25a. This output from the photodetector (lower trace) results 
from the impression of a 101101 word pattern on channel 1, channels 
2 through 4 having all 1’s impressed. This particular result is pre- 
sented as typical of the many different word patterns that have been 
successfully transmitted over the laser beam. The detected electrical 
output from the photodetector after transmission via the beam is to 
be compared with the electrical modulation waveform exiting from 
the modulator structure (upper trace). Two points are to be noted, one 
the fidelity, and two, the low noise. The first point is indicative of 
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Fig. 24—Experimental and computed beam profiles at diffraction-limited waist. 


the exceptionally wide bandwidth of the system while the latter 
augurs well for a multilevel type of coding. 

The bandwidth of the system is mainly determined by the length 
of the modulator crystal and a length of 0.6 inch results in a fre- 
quency axis crossing point of approximately 5 GHz. This conse- 
quentially produces, in the time domain, a step response rise time of 
approximately 200 ps. Figure 25b is an experimentally observed re- 
sponse for a 250-ps rise time pulse excitation (upper trace) ; the de- 
tected pulse (lower trace) exhibits a rise time of approximately 400 
ps which is commensurate with a system response time of 200 ps. 
The system bandwidth, of course, can be made arbitrarily large, up 
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Fig. 25—(a) Upper trace: electrical modulation signal exiting from modulator 
(2 ns/em, 2 V/cm); lower trace: detected optical signal after transmission over 
laser beam (2 ns/em, 10 mV/cm). (b) Upper trace: applied electrical modulation 
waveform; lower trace: detected optical signal after transmission over laser beam 
(500 ps/em). 
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to the limit of the photodetector, by reducing the length of the modu- 
lator crystal. This increase in bandwidth is paid for at the expense 
of smaller depths of modulation. 

Some small deviations in waveform shape are observable between 
that exiting from the modulator crystal (Fig. 25a upper trace) and 
that shown in Fig. 21b for which there is a pure resistive driving point 
impedance. This is attributable to the slight impedance mismatch 
between the modulator structure and the transmission line structure 
in which it is embedded. Its primary cause is reflections in the 
traveling-wave modulator rather than coupling paths to the electronic 
circuits since no significant reflections are observed in this path for test 
waveforms. The effect, however, is not great and the overall result is 
that of widening the PCM eye pattern, which is a desirable feature. 

The noise level observed in the waveform of Fig. 25a is attributable 
to a combination of the noise in the photodetector, that due to the 
laser beam, and that due to the sampling noise in the instrument 
used to observe the waveform. The latter source, which is intrinsically 
unrelated to the actual system, provides the dominant contribution. 
Nevertheless, taking the signal-to-noise ratio as the observed 20 dB, 
results in a possible system information rate in excess of 3 Gb/s in 
accordance with the Shannon-Hartley law. This information rate 
would require, of course, a more efficient coding scheme. 

Long-term stability of the system is measured in hours. Loss of 
stability is due to the onset of mode competition resulting from 
etalon adjustment wandering, although it is not difficult to envisage 
a correction mechanism via a monitoring and servomechanism scheme 
to extend this time to arbitrarily long periods. The modulator crystal 
is not temperature controlled and ambient temperature variations will 
cause a loss of system bias point. As discussed earlier, this is readily 
rectified using a cascaded rod structure to achieve bias stabilization. 

Once detected, the received signal may be demultiplexed conveniently 
by existing techniques. A suitable optical technique has been described 
by Kinsel®! for demultiplexing two channels and a related technique 
for an arbitrarily large number of channels has been proposed by 
Chen.*? Electronic techniques employing a sampling scheme are also 
applicable without placing severe constraints on the demultiplexer 
circuit. This relaxation in requirements results from the relatively 
small amplitudes of the detected waveform. In repeater applications, 
the demultiplexed pulses can be conveniently amplified in relatively 
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low-frequency amplifiers (250 Mb/s) in order to provide drive sig- 
nals to subsequent information gates. 


VIII. CONCLUSIONS 


An optical communications system has been described which has a 
demonstrated information capacity of 1 Gb/s. The major difficulties 
to achieving high information rates have been outlined as lying in 
high-speed circuit realization and the interfacing of high-speed 
electronic circuits and optics. These difficulties have been alleviated by 
a system architecture and subsystem design which not only results 
in the high information rate described but shows promise for future 
systems of even higher rates. These higher rates could be achieved 
by a multiplexing of quantized electrical pulse amplitudes, and the 
possibility also exists for a further increase in rate with optical multi- 
plexing. 

The realization of the high-speed performance of the electronic cir- 
cuits relies not only on the design configuration but heavily on the 
fabrication technology employed. In the case of the 1-Gb/s system a 
beam lead chip on alumina substrate technology has provided suffi- 
ciently low parasitic reactance values to achieve the desired speed 
performance. In a wholly electronic communications system, the cir- 
cuits evolved in this study could find advantageous application in 
achieving data rates which are closer to the most efficient bandwidth 
utilization rate for coaxial systems.®* 

Transmission channels have not been considered in this paper; these 
channels could be of an open atmospheric, gas lens guide, or fiber optic 
type. Recent advances in low-loss fibers®*** make these look par- 
ticularly attractive for optical transmission and efficient utilization 
of these fibers could call for wide bandwidth schemes of the type 
described. 

Practical laser communications systems already are in use!*** and 
the exponential growth in bandwidth requirements will undoubtedly 
force a greater exploitation of the laser portions of the electromagnetic 
spectrum. Even the high information rates described nowhere near 
approach the theoretical bandwidth capabilities of optical communi- 
cations systems. This ultimate limit will probably only be realized 
with improved optical processing of logic so that the potential of 
the extremely short pulses now available from such lasers as the 
neodymium YAG and the solid state GaAs forms can be realized. 
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Analysis of Dependence Effects in 
Telephone Trunking Networks 


By J. M. HOLTZMAN 
(Manuscript received March 10, 1971) 


Since theoretical and computational difficulties often preclude exact 
solution of telephone trunking network problems, approximate methods 
are naturally used. A typical approach is to determine link* blocking 
probabilities and from them calculate point-to-point blocking probabilities 
by wnvoking independence assumptions. Although the link blocking prob- 
abilities may be quite accurate, the point-to-point blocking probability 
calculations will, in some cases, suffer from the independence assumptions. 
This paper presents a method of taking dependence into account for certain 
networks by approximating conditional probabilities which reflect the 
dependence. The approximations avoid the problems of dealing with the 
large sums associated with this problem. 


I, INTRODUCTION 


The exact analysis of telephone trunking networks often leads to severe 
computational problems due, e.g., to the large number of possible states. 
Approximate methods are thus naturally used. There has been much 
success In approximately calculating link blocking probabilities but less 
in determining point-to-point blocking probabilities. Errors in the point- 
to-point blocking probabilities can be caused by midependence as- 
sumptions. 

The purpose of this paper is to take advantage of existing techniques 
for approximating link blocking probabilities (which are quite accurate) 
and develop an approach for taking link dependences into account. In 
particular, we present a method for approximating the appropriate 
conditional probabilities for cases where the traffics are Poisson (or close 
to Poisson). The extension of the approach to the case of distinctly non- 
Poisson processes, such as arise in overflows, will be reported in Ref. 1. 
To obtain point-point blocking probabilities for non-Poisson processes, 


* We use link to denote trunk group. 
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one must also solve for the blocking seen by individual traffies when 
more than one traffic is offered to a link (the equivalent random method 
gives the blocking seen by the combined traffic). This is also treated in 
Ref. 1. . 

To clarify the role of independence assumptions, suppose. there is 
common traffic on links 1 and 2. Let A, and A, be the events that the. 
common traffic is blocked on links 1 and 2, respectively. Then the 
probability of being blocked on either link 1 or on link 2, P{A,\U A,} = 
P{A,} + P{A,} — P{A, CQ Ao}, is often approximated by P{A,U Ap} . 
= P{A,} + P{A.} — P{A,}P{A,.} oreven by P{A,U Ap} & P{A,} + 
P{A,}. The last approximation is clearly accurate if P{A,} + P{A.}'> 
P{A,(C\ Az} and the first is accurate if P{A, | A.} & P{A,} or if both 
P{A,( Az} and P{A,}P{A,} are relatively small. On the other hand, 
it is easy to give examples where the neglect of dependence leads to 
non-negligible errors. | 

The errors due to neglecting dependence naturally depend strongly. 
on mutual traffic. To see this in a transparent case, let us reconsider 
the two-link network mentioned above with each link having. the same 
number of trunks. P{A,|A2} could vary from P{A,} (when the 
mutual traffic is zero) to unity (when there is no traffic on link 2 not 
shared with link 1). In the latter extreme case, assuming independence 
would give P{A,} + P{A.} (1 — P{A,}) for P{A, U A.} compared 
to the correct answer P{A,}. Thus, assuming independence could, 
in this case, conceivably overestimate the point-to-point blocking 
probabilities by something approaching 100 percent. In fact, ‘it is 
possible to overestimate a point-to-point blocking probability on m 
tandem links by almost as much as m times by assuming independence. 


II. BASIC APPROACH 


For simplicity of explanation, first consider the situation shown in 
Fig. 1. 4; 2, and Ay2 are the parameters of mutually independent 
Poisson processes. Holding times are mutually independent exponential 
random variables with unity mean (or the mean is the time unit) 
here and throughout this paper. Also, throughout the paper we shall 
assume that lost calls are cleared and that the system is in equilibrium.. 
Links 1 and 2 have N, and N; trunks, respectively. Assume N, = Ny. 

Let A, and A, denote the events of link 1 and link 2 being blocked, 
respectively (i.e., there are N, calls up on link 1 and N, calls up on 
link 2). _ 

All of the equilibrium state probabilities may actually be expressed in 
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closed form using, e.g., the form of solution given in Ref. 2, Section 7. 
For example, 


N ito Noi 5. Ne . 
2 i" r7ti2 x 2—1t12 pe 
: ot: pt Ae. 
=0 t12! =o by! =o 22! 
Pi a ee ee ee (1) 
1 ur) G2? 
1 2 12 
ittimsn, ti! te! to! 
tattieSNeg 





where the indices are nonnegative integers (throughout the paper). 
However, the use of such exact results very quickly becomes impractical 
for all but the smallest problems even for large computers because of the 
number of computations required. 

The link blocking probabilities can usually be quite well approximated. 
For example, the following reduced load equations, 


P, = BIN, ,. + An(l — P.)), (2) 
P, = B(Nz, 2 + Ma(l — P,)), (3) 
PIAL ESP.. (4) 
PlA= P,. (5) 


(B(N, 2) is the Erlang B formula) are often sufficiently accurate as they 
stand.* See Ref. 3 for much more discussion of link blocking probabilities. 

Once the link blocking probabilities are determined (however they 
are determined) all that is left to calculate is P{A, | A2} in order to 
determine P{A, U A,}. We first write down this probability exactly and 
then show a simple practical approximation. (Approximating the sums 
becomes even more important when considering more complicated 
dependences as in Section III.) To this end, note that 


No te Ne-iie N27 F212 
12 2 1 


>» diol (No — in)! (Ni — tia)! 


Ne tie Neortie Ni-iie Pt 
12 2 1 


412=0 tie! (N2 == 12)! 1,=0 1,! 


N Ni-t 
: Na) t Ne-i 1" 
_ EC (Vi — 7)! 6) 


Na INN & > Ni-i yi1 
> ( “lp q’? > t 


i=o \ @ Azo 11! 


P{A, | Ao} = 





* It may easily be shown that these P,; and P: agree exactly with P{A,} and 
P {Ae} to first-order terms in Ay. Although the P; usually overestimate the P{A;}, 
they can underestimate (seriously, in extreme cases) when A, is very large. 


2650 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1971 


N 
Ny No 
ene) 
PN r 


Fig. 1—Two-link network. 


where 
= Are 
Pp x Mis + No ? (7) 
gq=1-—p. (8) 


Observe that (6) is the ratio of two sums each of which is essentially 
a Bernstein polynomial (see Ref. 4). However, a Bernstein polynomial 
is of the form >>"_, (")p‘q” ‘f(é/n) but our f is actually a function of 7 
or of n(i/n). Each sum is also the expectation of a function of a bi- 
nomially distributed random variable. Using the following interpolation 
formula (see Ref. 5, p. 178), 


f) =f/@ + @— Dafa + 1) — fe — 1)] 
+ —7*3f@ +1) — 2f@— + /@—- 1), 9) 


and taking expectations yields 
ps ("oto 40 = f(np) + on [np + 1) — 2f(mp) + f(np — 1)]. (10) 


(10) is the usual expression for the mean and variance approximation to 
an expectation of a function of a random variable but with the second 
derivative replaced by a central difference. Using (10) on the numerator 
and denominator of (6) yields 


P{A, | Ao} & P.{A, | As} 


Napa | Mt = N, = ve | 
Se 
awe tt le = | 
B(N, — Nop, »,) 2 N, — Nop +1 


Interpolation, such as given on page 571 of Ref. 3, may be used to 
evaluate the Erlang B formula for a nonintegral number of trunks or an 
approximation for Erlang B in integral form can be used. 

Note that we use the same type of approximation on the numerator 


DEPENDENCE EFFECTS 2651 


and denominator, which are closely related, so that the ratio can be more 
accurate than either taken separately. 

(11) shows quantitatively what we expect qualitatively, namely, that 
the dependence effect gets small as the common traffic gets small. In 
particular, when \i2 = 0, P{A, | Ao} = B(N,, 1) = P{Ai} and when 
Ne = 0, P{A, | A.} = B(N, — No, d1) (recall that we assumed N, 2 N2). 

To examine the approximation, we find it convenient to first consider 
P,{A{|A2} where superscript c denotes complement. Actually, since 
P{A, U Az} = P{A,} + P{A{| A.}P{A.}, P{A{ | A2} is really the 
crucial quantity. With VN, = N., we have 





PiJAti Az} = 1 = Pi A, | A,} 


a Napa = 200, = Np) 
_ BW. — Np, ) 1+ 1 XY 











1 + Nepal Mi -2| 
B(N, — Nop, ») 2 N, —Nop+1 


(12) 
If we let N, = N, = N, then 
a oe Nopq E 2a 
1+ <4 
c ) 1 Ay 
© | As} _BWa x) * 7 2 Ld (13) 


a at —~ -} Npq Ey = 2| 
BWq, »:) 2 LNg 1 
(12) could have been derived directly using a mean and second central 
difference approximation just as (6) was approximated by (11). The 
terms multiplied by N.pq/2 in both the numerator and denominator 
are then the second central difference terms. If we hold \, and p fixed 
and let N get large, it can be shown that the second central difference 
terms get small compared to the mean terms sugg esting that the ap- 
proximation is accurate for large N. However, since P,{A, | A2} > 0 as 
N — (with i, and p fixed), investigation of this type ee is 
of limited practical value. 

It is probably of more interest to examine the approximation for 
fixed p and to let \, get large with N. To this end, let \, = AN with 
k > q. For sufficiently large n anda > n, B(n, a) = 1 — n/aso that 


B(Ng, kN) Y1— = (14) 


Hence, we obtain for large JN, 
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1+ Meal kN — 24 Nal 





9 |No+1 kN 
a a a kN 
1, Mel i | 
1-4 q+ 
k 
& B(Ng, kN) (15) 


which has the interesting interpretation that with link 2 full, the number 
of trunks on link 1 to handle the \, traffic is reduced by the average 
number of calls on link 2 which are common to link 1 (we shall elaborate 
on this below). Although (15) is intuitively appealing and obviously 
correct for p = 0 or 1, its accuracy should be examined for 7p e (0, 1). 
Observe that for large N, the variance terms dominate. 

By elaborating on the interpretation alluded to above, we can see 
why B(Ngq, i) should tend to overestimate P{A, | A.}. With D; the 
event that 7 calls from the },, traffic are up, 


D; = {tx calls up}, (16) 
observe that 
Na 
Ne 
= » BN, — 1, s)P{D; | A>}. (18) 


7=0 


Now, 7f the 12 calls were never blocked on link 1, then P{D; | A.} = 
(“)p'q’’. With N, = N. = N, , = KN, it may be shown that 

to BIN: — 7, uu) @)p'g"* > BOM: — Nop, i) = B(NG,d) asN > &. 
B(N, — Nop, »,) is the blocking probability on link 1 when the number 
of trunks is reduced by the conditional mean of trunks occupied by 
iw Calls. Hence, under these conditions, B(Nq, \,) approximates 
P{A, | A.2} by ignoring the blocking of \,2 calls on link 1. If we take 
this blocking into account, we would expect that for large N, P{A,| A} 
< B(Nq, 1). This will be seen to be the case below. 

Another approximation for P{A, | A} is 


rq A pry e/V mB, 
req + pre) e8"?/(L = 12) V 2B 





P,{A, | As} = (19) 


where 


», = V Org = Nip + Nop)” + 4Nipg — ig + Np — Nip 


ee (20) 
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6, = N, - v{—Pa_) ) (21) 
_ 7 Drs 2 to 2 
a =, — N(—P2-) + (2), (22) 


and 7. is the solution between 0 and 1 of the cubic 
prs + (Nop — p — Nip + dq — Mp)rs 
+ (—d\i¢g + Nip — Nig — Nop — Oro + Nig = 0. (23) 


This approximation is due to D. L. Jagerman (see the derivation in the 





Appendix). 
For VN, = N, = NN, = KN, and N large, 
keg + 4kpq — I 
P,{A; | Ae} 21 —-— v d ae 4. (24) 


Table I shows some numerical values of the exact and approximate 
conditional probabilities along with the values of the approximations 
for large N. It is seen that, although P,{A, | A.} does not always surpass 
the simpler P,{A, | A2}, it is, on the whole, superior and its behavior 
with increasing WN is clearly better. We dwelled on P,{A, | A2} because 
it is useful in many cases and the mean and variance approach is easily 





TABLE I—NUMERICAL VALUES 





























% error % error 
in in 
p Ar = N | P{A1|A2}}Paf{Ai| Ac}|/P,{A1 | Ac}|PafAr | As}]Ps{A1 | As} 

0.1 5 0.3256 0.3246 0.3404 —0.3 4.5 
0.1 10 0.26087 | 0.26091 | 0.2732 0.015 4.7 
0.1 15 0.2296 0.2287 0.2402 —0.4 4.6 
0.1 20 0.2101 0.2096 0.2197 —0.2 4.5 
0.1 25 0.1966 0.1957 0.2052 —0.4 4.4 
0.1 co 0.1 0.092 : 
0.5 5 0.5017 0.4926 0.5125 —1.8 2.15 
0.5 10 0.4579 0.4635 0.4645 1.2 1.4 
0.5 15 0.4386 0.4587 0.4431 4.6 1.0 
0.5 20 0.4274 0.4599 0.4307 7.6 0.8 
0.5 25 0.4200 0.4621 0.4226 10.0 0.6 
0.5 <0 0.5 0.382 
0.9 5 0.7806 0.8161 0.7827 4.5 0.25 
0.9 10 0.7521 0.8258 0.7535 9.8 0.19 
0.9 15 0.7416 0.8412 0.7421 13.4 0.11 
0.9 20 0.7360 0.8525 0.7366 15.8 0.07 
0.9 25 0.7326 0.8604 0.7330 17.4 0.05 
0.9 oo 0.9 0.718 
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extendable to cases where extreme accuracy is not required (as in 
Section III). It can be easily seen that a percentage error in the con- 
ditional probability typically causes a smaller percent error in the 
point-to-point blocking probability. 


III. MORE COMPLICATED DEPENDENCES 


The extension of the basic idea of Section IT is often rather straight- 
forward. For example, consider the network in Fig. 2 which is com- 
plicated enough to illustrate the ingredients of a general approach. 

Assume (for the sake of being concrete) that VN, 2 N3; 2 Nz». All of 
the traffics are mutually independent Poisson. Let A;, A2, A; represent 
the events of links, 1, 2, 3 being blocked, respectively. Then the fol- 
lowing are approximations to conditional probabilities: 


= Np Wee 
> ( Padi (N= @! 





A, | A.} & = , aa? (25) 
. 2; @ ‘pial a 
i=0 fro 9} 
Ns N. ing > t 
> ( Y pial! a! =a! 
P{A; | A>} > oe a (26) 
a @ “pioals an 
i=0 0 7: 
Ns N ; mn 1 Ni-i 
aa or yn AT 
P{A, | As} = Ns Nant ra (27) 
2 Na-t = Oy! 
Di3Qis 7 
t= > 2 0 
where 
ee eee) 
=f a DN CAC TL AL DN me 9 
Pie N42 + Mia(1 7: Ps) -+- Ne + Noa (1 a P;) } Qi2 Pie } ( 8) 


Ble an Sone —-—— > 
ee eee jc ee 
Bosse ————— > 
Ny N2 N3 
—_—_——_—_——— SO OO Oe OD 
vA x 
outing ona i Ss > 


Fig. 2—Three-link network. 
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do3 + A3(1 = P,) 
ie re ee se 
P32 Nos + Ais(l — Py) + de + Ae(1 — P,)’ G32 Pao , (29) 


= Aua(1 =a P.) ne _ 
Ps = X00 — P,) + Fall — Bd)’ ae 
x =)i + me i Plas [AslP.) ’ (31) 


and the P; are approximations to the link blocking probabilities. In 
(31), the approximation to P{A;|A.} is used (and we are obviously 
assuming P; > 0). 

The rationale behind (25)—(27) is based on (6) at least in the case 
where the P; are small so that the carried traffics are not too far from 
Poisson. We have already discussed approximating the sums of (25)- 
(27) in Section IT. 

But for the network on Fig. 2 we should also calculate P{A, U 
A, U As}, the blocking probability seen by the ),; traffic. Since 


P{A,U A, U Aj} 
= P{A,} + P{A.} + P{A3} 
— P{A, | As}P{Az} — P{A, | As}P{ Aa} 
— P{A; | As}P{ 4s} + P{A, 0 As | As}P{42}, (32) 


we see that we need only discuss the evaluation of P{A; ™ A; | A3}, 
the other quantities already having been treated. 
Just as we derived (6), we can show that (exactly) 


P{A, CA; | Ay} 


N | pi3"p93°Di3 De: pee i yiactian sis 
2° 12 2 2 3 1 
_ isetiosttistis=Ne tie! tog! tig! te! (Ns — tos — tis)! (Ni — tie — tis)! 


. . yi ey a ee a ee F 
N.2! pi2"p23'Pi3 Do" a—~t23—413 As. 1 tig 7t13 Mi 





digtiostiisti2=Ne 419! tog! t13! 22! is=0 13! i1=0 11! 
(33) 
where now 
Are 
i2 > Pe 34 
Pie he Fas Fe Fe? ey 
dos 


Pes = Nace hae + he Pe my 
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These sums may be approximated by recognizing that they are both 
expectations of functions of multinominal random variables. Thus, 


BGs 3 te 5) Gs ks Te) 
+ 4{[E(éss — Ys)” 823 + Elis — ths)” dis + Elise — ts)? 82s] 
+ 2E[(i2s — tes)(tis — Tis) ]4623H61 
+ 2E[(i2s — tos) (tre — Tr2)]4823K612 
+ 2E[(iss — Us) (tr2 — Fiz) wOrsb12)}f(des , Tas 5 Tae) (38) 


where 7,» here indicates expected value and where 6” is a second central 
difference, e.g., 


deaf (os > t13 4 712) = f Ges + 1% Vs , 712) = 2f (ies » tis , 712) 


“TF f(Zes = 1, tis; T12) (39) 


(36) 


(37) 


and the operator 6 is given, e.g., by 
Mish (Ts y Tis, U2) = Af (das »%3 + 1, te) — fies, tis — 1,%12)] (40) 
so that, e.g., 
523813) (G23 y Ta y Tie) 
= 2[fls + 1, tis + 1, tie) + flrs — 1, tis — 1, hie) 
— f(s — 1, tis +1, tie) — fs +1, tis — 1, ti2)]. (41) 


The variances and covariances for the multinominal distribution are 
given on page 164 of Ref. 6. Thus, 


Vik = Nopjx ) jk = 23, 13, 12, (42) 
oi = E(t, — tx)” = Nopa(l — pi), jk = 23, 18, 12, (43) 

Tje,im = Ellis, — Ux) im — tim)] = —Nopixpin ; 
jk, lm = 23, 13, 12(jk x Im). (44) 


One obtains 
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1 A, ~{ 95 508% as = = 





1. tet = AsAi )43 1 2 (Ms adh Ar ) 
pe Xs ao (V3 +1(N,+ 1 T 9% i ak Nel 
1 2 (Mas, ae 1) N, N3dy ) 

Paar 023 — ft aaa mca 2 es, IN ALR, Shae aha keene VEN, OL ee 
ee EB dz MA - (Ns + 2)(N3 + 1)(N. + D 
2 (NaN. _ Nadi ) 
os oi. hadi ae (Nab) 1) 


2 (NaNi(Ni — 1) _ AsAt \/ 
7 mal oe aa (N3 + 1)(N, + 2)(V, + 1) 


{Bs'By’ + 4023(B3(— 1) Br’ — 2B;"Br* + Bz'(1)By') 
+ 301;(Bs'(—1)By'(—1) — 2B;'By' + Bs"(1)Br'(1)) 
+ 407,(B;'Bi'(—1) — 2B;'By' + B;'By'(1) 
+ $lo%s,13(Bs'(—2)By'(—1) — 2B3'By’ + B;*(2)By'(1)) 
+ 0%8,12(Bs'(—1)Br'(—1) — 2Bs'Br* + Bs’(1)Br'(1)) 








+ o%8,12(Bs '(— 1) Br'(—2) — 2B5'By* + Bs*(1)Br'(2))]} (45) 
where 
N; = Nz — t23 — Lis , (46) 
N, = Ni — te — U3, (47) 
and for 7 = 3 or 1, 
B;* = 1/BWN; Nai (48) 
B;*(-1) = B;* — 1, (49) 
pie pe Si Me 7 (50) 
By (ly = By? + = (51) 
7 7 + 1 Py 
BP OyBe fs So sas ae (52) 


N; a (N; + 2); + 1) 


In many cases, P{ A; U A, U Az} may be sufficiently well approxi- 
mated by using B(N; , \3)B(N, , »x) for P{A, (\ A; | Az} since the 
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last term in (32) is typically smaller than the preceding three which 
are, in turn, typically smaller than the first three. 


IV. DISCUSSION 


We tried the method out in a number of examples and found the 
dependence analyses to improve accuracy considerably when dependence 
is indeed important and to be useful in indicating when the dependence 
effect can be safely ignored. This investigation was actually motivated 
by a study of the use of a satellite to carry telephone traffic. (This study 
will be described more fully elsewhere.) One mode of such use, called 
‘variable destination,” allocates up-channels (each channel consists 
of a number of trunks) from ground stations to the satellite while all 
ground stations can receive all that is transmitted downward from 
the satellite. An equivalent network for five ground stations (the outer 
- nodes) is shown in Fig. 3. Between each pair of ground stations there 
is an offered traffic; thus, there are 10 (2-way) traffies. For various 
choices of the traffics and allocations, the dependence effect becomes 
significant and it was important to know when it needed to be taken 
into account and when it could safely be ignored (which leads to sim- 
plification in the study, particularly the optimization algorithms—an 
aspect of the study was to optimize the channel allocations to minimize 
the traffic blocked). The method of this paper was found quite useful 
in this regard. It should be noted that, although the N; are very large 
and results of the form of (1) lead to huge sums, the method was com- 
putationally simple. 

The approach to dependence given here is one of approximations 
motivated by exact results in simple contexts. The results obtained 
thus far are promising and encourage further work in investigating 
the approximations and in extending the approach. Further experience 
is needed to determine the best way to apply the results to large net- 
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Fig. 3—Equivalent network for satellite variable destination mode. 
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works (see the last comment in Section III). Observe that improved 
accuracy (and, possibly, satisfactory accuracy from a practical view- 
point) in large networks can be achieved by considering only two-link 
or only two- and three-link dependences (already discussed in this paper). 
An inherent assumption that has been used is that the traffics are 
Poisson (or close to Poisson). As mentioned in Section I, the analysis of 
point-point blocking probabilities in conjunction with the use of the 
equivalent random method will be reported in Ref. 1. 
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APPENDIX 


In Section II, two approximations for P{A,|A.} were given. This 
Appendix derives the sum approximations for P,{A,| A2}. In particular, 
approximations for the following sums will be obtained:* 


“ (n ton-i Mt ue : ‘ 
oo > (")p%a =a al ahi sil 63) 
n ; N-iyi N-& 
n= Ee Sh =a EI 4 


The random variable é is distributed according to the binomial law. 
One has 


x 1 —N+i-1 hz 
Wp mes e” dz, (55) 
N-i rn? 1 gets 2 
20 tt = Oni aa ‘Tay (56) 


in which the integration is on a circle around the origin as center. 


Let z = re’, then 


yaa 7 ay . erretP—(N-1) 40, (N-1) gig 

(N — j)! 9 Qn (57) 
N-i\l 7 : RT a La) 
= S5 x : grreth-(N-1) 40 7a i dé. (58) 


* The derivations to follow are due to D. L. Jagerman who prepared this Appendix. 
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Computing the required expectations from (57) and (58), one obtains 


fos f ge eg 4 nade: (59) 
2a J_, 
= a "pei 8-NiO—-N ae( OF —7In ?) 
Jy = oe eer dé (60) 
in which the characteristic function ¢(7) is given by 
o(r) = Ele"*] = (q + pe’y’. (61) 
Thus 
qe il * gh ag (62) 
' IS oy : 
see ig eh ag (63) 
a) ae : 
in which 
h,(0) = re’ — Nio— Nlnr+nIn(q + pre’), (64) 


h2(0) = dre’ — Nie — N Inr + nln (q + pre’’) — In (1 — re’). (65) 
The expansions of h,(@) and h.(@) in powers of @ are 


hi(0) = 9 — NInr +nIn (q+ pr) 


7 _mpr 
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“ifeease eet 
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h.(0) = 9 — N Inr+nI1n(q+ pr) — Ind —- 7) 








npr 


+ (w- w+ 4 io 
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(67) 


. Using the method of Hayman (see Ref. 7) the functions h,(6), h2(@) 
will be made stationary at 6 = 0 by defining 7, , r, as follows: 
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7 Nps te 
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Thus the values of r satisfy 
pri + (Aq — Np + np)r, — Nq = 0, (70) 
prs -+ (np — p — Np + dq — dp)r2 
+ (—r\q + Np — Nq — np — Qr2 + Nq = 0. (71) 
For r, , the following explicit formula is obtained: 


r= Va — Np + np) ee Mgt Np OP 9) 


For r, , the root between zero and one must be used. Let 














2 npr ( pri ) 
By Ary + q+ pn n q+ pr’ (73) 
» pro (ps rs ( rs ) 
Bo NE eae Learaers sa Re =a (74) 
then 
h,(0) = 1 — N Int, +n In (q¢ + pr) — 36.6 + °°: , (75) 
h.(0) = 2 —-N Inrz +n In (q+ pr.) — Nn (1 — m) — $82.67 + °°: 
(76) 
Since 
Lf" _1¢92 1 
20 bee V/ 2nB ( ) 


in which the approximation becomes the more accurate the larger 2 is, 
one now obtains 
en 


J, = ri"(q + pr)” Vind, : (78) 





Are 


J, = af PP -_—___- 79 
r (q + pre) (1 — r.) V 2m, ( ) 
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A Laboratory System for Measuring 
Loudness Loss of Telephone Connections 


By J. L. SULLIVAN 
(Manuscript received August 5, 1970) 


Transmission performance of the telephone message network has im- 
proved steadily over the years. Coincidental with this improvement has 
been the evolution of rating plans which provide the basis for transmission 
planning, design, and evaluation. Loudness of telephone speech was an 
important consideration in this evolution. 

Telephone speech loudness continues to be one of the major factors which 
need to be taken into account in telephone transmission engineering. This 
paper covers a laboratory system, called EARS (EHlectro-Acoustic Rating 
System), devised to make objective measurements of partial and overall 
telephone connections (including electro-acoustic transducer efficiencies) in a 
manner which reflects subjective loudness loss. Topics covered include a 
historical review of rating plans, computation of speech loudness, evolution 
of the EARS, and description of the system and tts capabilities. 

The EARS essentially comprises a sound source and a meter for meas- 
uring either acoustical pressure or electrical voltage. Design of the source 
and meter are based on telephone speech loudness considerations, and 
measurements made with the system approximate subjective loudness 
judgments with an accuracy which 1s sufficient for telephone engineering 
pur poses. 

The EARS may be used in implementing any telephone transmission 
rating plan incorporating speech loudness loss as an element. In such 
application, the system can be used for specifying connection losses, thus 
eliminating the need for extensive subjective tests of loudness. However, 
subjective tests will still be required to evaluate effects of other elements, 
€.g., noise, important in any given plan, and to evaluate the range of loud- 
ness losses acceptable for that plan. 


I. INTRODUCTION 


A basic Bell System objective—to provide our customers with the 
best possible telephone message transmission consistent with the 
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state of the art and the economic climate—has remained essentially 
unchanged since the early days of telephony. However, continuing 
review of the telephone system in terms of this objective has resulted 
in steady improvement in transmission performance of the system 
over the years. Such improvement has been made possible by growth 
in our technical skills; it has been made necessary by evolving 
customer needs for improved transmission. Indeed, it has been postu- 
lated that as our customers use the telephone, they become ac- 
customed to current performance and come to expect further im- 
provement? 

What do we mean by telephone message transmission performance? 
In the broadest sense, this refers to the effect of the system on speech 
signals when these signals are transmitted over telephone connections. 
Customers conversing over telephone connections want to hear reason- 
ably faithful, undistorted reproductions of each others’ voices with 
a minimum of effort. Connections for which these conditions pertain 
can be thought of as providing satisfactory transmission performance. 
Connections exhibiting severe distortion would thus provide some- 
thing less than satisfactory performance; customers might be able to 
converse but only with extreme difficulty. 

Speech transmission capabilities of the telephone network are often 
considered in terms of individual transmission parameters, the com- 
bination of which determines overall transmission performance. Some 
of the more important parameters are loss, amplitude distortion, and 
unwanted interferences such as noise, crosstalk, and echo. Improve- 
ment in performance over the years has been achieved by design to 
control these parameters, singly and in combination, as dictated by 
the technology, economics, and needs of the times. 

Coincidental with this improvement in transmission performance 
has been the evolution of telephone transmission rating. The problem 
of rating, that is evaluation and measurement, has been the subject of 
much thought and work over many years, and a number of different 
rating plans evolved to meet the transmission design needs for an 
improving and expanding telephone message network. 

For present purposes, a transmission rating plan comprises (t) a 
rating criterion which represents the basis for rating telephone con- 
nections, (ii) a reference system (may be a physical simulation of an 
overall telephone connection, a set of definitions, or both) with some 
adjustable feature in terms of which connection ratings are estab- 
lished based on the selected criterion, and (wi) a rating scale which 
is essentially represented by the variable feature of the reference 
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system. Thus, a rating plan provides a framework for the design and 
evaluation of telephone connections. 

Historically, the rating criterion was subjective in nature, and ex- 
tensive subjective testing was required to evaluate telephone connec- 
tions in terms of the reference. Moreover, application of the various 
rating plans required subjective testing to evaluate scale values, i.e., 
determine what scale values (and connections) represented acceptable 
performance. 

Transmission performance of the telephone message network has 
improved to a point that loudness loss is a major variable which has 
to be taken into account in transmission planning. This paper de- 
scribes a laboratory rating system, called the EARS (Electro-Acoustic 
Rating System), which can be used to objectively measure loudness 
loss of telephone connections in a manner which closely approximates 
subjective loudness judgments. Thus, the system supplants subjective 
tests which would ordinarily be required to determine loudness ratings 
of connections. However, tests will still be needed both to determine 
subjective reaction to various amounts of loudness loss, and to deter- 
mine interrelationships between loudness loss and other transmission 
parameters important in any given rating plan. 

The EARS was devised to measure acoustic pressures and electric 
voltage as required by loudness rating definitions for partial and 
overall telephone connections.? The EARS has been used extensively 
over the past several years to characterize the loudness performance 
of telephone sets and connections, and to evaluate design plans such 
as unigauge design of the customer loop plant.* Moreover, loudness 
loss as measured with the EARS is an important element in current 
studies of transmission planning based on a multiparameter approach 
in which other transmission factors, e.g., noise and echo, are included 
with loudness loss.> Also, the EARS concept is one of several candi- 
dates for adoption as a standard method of specifying loudness loss.* 

The EARS essentially comprises a sound source which is used to 
energize the talking end of a telephone connection and an indicating 
meter which is used to measure acoustic pressure or electrical voltage, 
and provides a simple means of measuring input and output signal 
levels for partial and overall telephone connections. Loudness losses 


* Methods of measuring the loudness loss of telephone connections are cur- 
rently being studied by (2) Study Group XII of the CCITT (Comite Con- 
sultatif International Telegraphique et Telephonique—International Telegraph 
and Telephone Consultative Committee) as outlined in Question 15/XII of Ref. 
6 and (2) the Task Force on Telephone Instrument Testing of the Institute of 
Electrical and Electronics Engineers. 
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of connections and connection components, including electro-acoustic 
transducers, i.e., telephone transmitters and receivers, are then the 
differences between input and output signal levels expressed in dB-like 
terms relative to appropriate reference signal levels. 

The approach followed in this paper is first to consider the history 
of transmission rating plans and, second, to discuss the evolution of 
the EARS. Discussion of rating plans in Section II demonstrates the 
interrelationships of rating plans, rating systems, and network trans- 
mission performance improvements. Concluding remarks outline the 
relation between loudness rating definitions and the rating system, 
the EARS, which is used to determine parameter values as required 
by the rating definitions. Also mentioned are current studies con- 
cerned with multiparameter network design and evaluation. 

Discussion of the evolution of the EARS involves several steps. We 
begin in Section III by addressing ourselves to a review of various 
techniques for computing the loudness of tones, noise, and speech, then 
discuss in some detail the derivation of a particular speech loudness 
computation method. This discussion indicates the manner in which 
frequency response characteristics of partial and overall connections 
can be measured and shows how, from the measured response for a 
given connection, we can compute a number which we will call loud- 
ness. Also included is a comparison of computed and experimental 
results. 

Section IV covers derivation of the EARS from the speech loudness 
computation method referred to above and describes the EARS in its 
present form. Also described is a graphical computation method based 
on the design concepts leading to the EARS. 

In Section V, we discuss the accuracy of the EARS in predicting 
subjective test results and consider the effects of simplifying assump- 
tions employed in deriving the EARS. The vehicle for this is the 
graphical method referred to above. Results computed using this 
method are compared to observed results for the subjective tests used 
in validating the speech loudness computation method. This approach 
is necessary since the subjective test systems are no longer available, 
and hence could not be measured directly using the EARS. Therefore, 
the comparison of computed and observed results reflects the accuracy 
of the concepts on which the EARS is based, and not of the EARS 
itself. 

As will become evident, the EARS in its present form is only one 
of several ways in which the computational method could be imple- 
mented as a laboratory measuring system, and is not an exact realiza- 
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tion of the computational method. This form was selected because it 
provides a satisfactory combination of simple equipment arrange- 
ments and suitable calibration and measurement procedures. 


II. EVOLUTION OF TRANSMISSION RATING PLANS 


Historically, transmission rating plans depended on (2) the use of 
reference circuits with which commercial circuits could be compared 
and (iz) systems of units for expressing the relative ratings thus 
determined. A great deal of subjective testing was necessary to deter- 
mine ratings of commercial telephone equipments and facilities, and 
to correlate these ratings with measurable transmission properties. 
These ratings, in terms of the appropriate prevalent unit for express- 
ing them, were used in design and evaluation of the telephone message 
network. 

Considerable work was also necessary to determine transmission 
objectives for the network. That is, the reference system and related 
units provided the means of expressing transmission performance; it 
was then necessary to determine the rating, or range of ratings, to 
which the plant should be designed. 


2.1 Transmission Equivalent Plan 


For a time following the turn of the century, telephone circuits in 
commercial use had quite similar characteristics, and conditions were 
such that essentially only loudness, or volume, capability was under 
control of the transmission engineer. The performance of circuits was 
determined by comparing them, on a loudness basis, with a reference 
circuit which was adjustable in attenuation, but whose other trans- 
mission characteristics were typical of commercial circuits. 

The reference circuit in use at that time was known as the Standard 
Cable Reference System (SCRS).7-° It consisted of telephone sets 
(including transmitters and receivers), cord circuits (for supplying the 
telephone set carbon transmitters with operating current), and an 
adjustable artificial line for interconnecting the cord circuits. These 
connection components were representative of types then used com- 
mercially. 

The rating of telephone circuits by comparing them to the reference 
circuit involved a talker speaking alternately over the test circuit and 
the reference circuit, and a listener switching similarly at the receiving 
ends. The reference circuit artificial line, calibrated in terms of “miles 
of standard cable,” was adjusted until the listener judged the volume 
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or loudness of the speech sounds reproduced by the two circuits to be 
equal. The number of miles of artificial line in the reference circuit 
was then used as the “transmission equivalent” of the circuit under 
test.? The effect of any change in a test circuit on the efficiency of 
that circuit could then be measured by determining the “transmission 
equivalent” before and after the change; the number of miles of 
artificial line required to compensate for the change was used as an 
index of this effect. 

Performance ratings assigned in terms of loss of speech volume or 
loudness based on the comparison procedure outlined above consti- 
tuted a practicable and effective means of assessing transmission per- 
formance. The adverse effects of sidetone, distortion, and noise on 
transmission quality were recognized, but no way was known of 
incorporating them, together with loudness loss, into a single figure 
of merit for rating transmission performance.* Moreover, the simi- 
larity between the reference circuit and commercial circuits at that 
time rendered such incorporation unnecessary. 

As the state of the telephone art developed, modifications in the 
reference circuit became desirable in order for the circuit to more 
satisfactorily fulfill its purpose. Three factors leading to the modifi- 
cations were (7) improved designs of telephone instruments and cir- 
cuits, (7) improved measuring techniques and instruments, and (2) 
the need for a more suitable unit than the “miles of standard cable.” 

The effect of improved design was to introduce into the plant 
telephone instruments and circuits which had less distortion than 
corresponding parts of the Standard Cable Reference System. For 
this reason, it became desirable to have a new reference system with 
which transmission over the most perfect telephone circuit, or over 
circuits less perfect, could be simulated at will.®® 

Secondly, the performance of the Standard Cable Reference System 
was specified by stating the kinds of apparatus and circuits used. The 
electrical portion of the system could be checked by voltage, current, 
and impedance measurements. However, for the transmitters and 
receivers, reliance for constancy of performance was placed primarily 
upon the careful maintenance and frequent cross comparisons (sub- 
jective) of a group of transmitters and receivers which were specially 
constructed to reduce some of the sources of variation in the regular 
product instruments.*® Improvements in measuring techniques and 
instruments made it possible to measure objectively characteristics 
of electro-acoustic transducers. This, in turn, permitted selection and 
maintenance of transducers to provide long-term stability. 
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Finally, a change in rating units became desirable for two reasons. 
First, circuits were being designed which had less distortion than the 
artificial cable of the Standard Cable Reference System. Since a mile 
of cable with this system corresponded not only to a certain volume 
change, but also to a distortion change, characterizing new circuits 
in terms of miles of standard cable implied a distortion degradation 
not necessarily attributable to the circuit under test. Secondly, two 
different reference systems were in common use, one in the United 
States, the other in some other countries.1° These systems used 
artificial cable with different characteristics. That used in the United 
States had a loop resistance of 88 ohms and a capacitance of 0.054 
microfarad per mile; the other used artificial cable having the same 
resistance and capacitance but had, in addition, an inductance of 1 
millihenry and a conductance of 1 micromho per mile. Thus, a test 
circuit compared with each of these references would have two dif- 
ferent ratings assigned to it. The first of the above reasons suggested 
the need for a distortionless unit, the second for a common unit. 

A new unit, called the Transmission Unit (TU), was devised by 
the Bell System.t°* This was a distortionless, logarithmic unit so 
chosen as to make use of common logarithms convenient in transmis- 
sion computations. Its magnitude was very nearly the same as the loss 
of a mile of standard cable and, thus, existing experience learned in 
terms of miles of standard cable could be transferred to the new 
system with a minimum of difficulty. The Transmission Unit later 
became the decibel (dB)."! 

The three factors discussed in preceding paragraphs resulted in 
design of the Master Reference System (MRS).*:° This system utilized 
transducers with very low distortion, amplifiers to compensate for the 
lower efficiency of these transducers as compared to commercial in- 
struments, and an artificial line (consisting essentially of a 600-ohm 
attenuator) for interconnecting the transmitting and receiving ele- 
ments. The system included provision for inserting distorting networks 
into the transmitting and receiving elements so that performance of 


* An intervening step between the “Mile of Standard Cable” and the “Trans- 
mission Unit” was the “800 Cycle Mile.” This also was a distortionless logarithmic 
unit, equal in magnitude to the loss of a mile of standard cable at 800 Hz 
(actually, the loss at 796 Hz was used). There were thus two “800 Cycle Mile” 
units because of the two different standard cable specifications. The “800 Cycle 
Mile” and its successor, the “Transmission Unit,” differed in that (2) the former 
represented a current ratio while the latter represented a power ratio and (it) 
although both were logarithmic, the former was in units of log 1.115 (one 800 
Cycle Mile corresponded to a current ratio of 1.115) while the latter was in units 
of 0.1 log 10 (one Transmission Unit corresponded to a power ratio of 10°-1). 
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commercial transducers, characterized by amplitude response curves 
with pronounced resonances, could be simulated. 

Transmission engineering was still done on a loudness basis, and 
the MRS was used to obtain ratings in the same manner as that 
previously described for the SCRS. That is, (2) speakers talked 
alternately over the reference and test systems, (27) observers listened 
over the systems, switching with the talkers, and (27) the line (600- 
ohm attenuator) of the MRS was adjusted to obtain equal loudness. 
The average setting of the attenuator (in dB) at balance was then 
the rating of the system under test. 

Adoption of the MRS by the Bell System had important long-term 
effects on international standardization of telephone reference sys- 
tems. In 1926, the CCI* invited representatives of the Bell System 
to meet with a committee appointed by the CCI to consider adoption 
of a transmission reference system.? At the recommendation of this 
committee, the CCI adopted the MRS. Two of these systems were 
built by the Bell System. One was retained at Bell Telephone Lab- 
oratories in New York, the other was sent to the laboratory of the 
CCI in Paris in 1928. The CCI System was designated the SFERT.* 

In the late 1950s, the SFERT was replaced by a new reference 
system, the NOSFER.? The NOSFER is so constructed and calibrated 
that it is essentially equivalent to the SFERT. Telephone connection 
ratings obtained with either the SFERT or the NOSFER are desig- 
nated as Reference Equivalents (RE) in dB, and are numerically 
equal to the setting (in dB) of the reference system line attenuator 
required for loudness balance. 


2.2 Effective Loss Plan 

In the early 1930s, the Bell System adopted a new transmission 
plan. The change became necessary because of technological advances. 
Telephone sets incorporating antisidetone circuitry and improved 
transducers began to be used in quantity in the Bell System. These 
sets had poorer loudness performance than their predecessors, but 


* CCI = Comité Consultatif International des Communications Telephoniques 
a Grande Distance (International Consultative Committee for Long Distance 
Telephone Communication). This organization later became the CCIF and is 
now the CCITT. 

+SFERT = Systeme Foudamental European de Reference pour la Trans- 
mission Telephonique (European Fundamental Reference System for Telephone 
Tratismesion): See Ref. 12. 

t+ NOSFER = Nouveau Systeme Foudamental pour la Determination des 
Equivalents de . Reference (New Fundamental System for the Determination of 
Reference Equivalents). See Ref. 6. 
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provided marked improvement in such characteristics as sidetone, 
amplitude distortion, and nonlinear distortion, all of which have a 
marked effect. on transmission. Evaluation of these telephone sets in 
terms of the old system tended to emphasize loudness differences at 
the expense of these other factors. Thus, there was need for a rating 
plan which properly recognized the improvements and, at the same 
time, retained applicability for the older telephone set types. The 
effective loss plan was devised to meet this need.1*14 

Ratings under the new plan were in terms of dB of “effective” loss 
to distinguish them from “loudness” or “volume” losses of the old 
plan. Effective loss represented a figure of merit for evaluating the 
effectiveness of the transmission over telephone circuits and, as such, 
was a measure of the ability of telephone listeners to understand as 
well as to hear transmitted telephone speech. 

Effective transmission data were determined in terms of the Work- 
ing Reference System (WRS).1* This system consisted of representa- 
tive customer loops and telephone sets and a variable, distortionless 
trunk, i.e., an attenuator. Ratings for commercial connections were 
obtained in effect by comparing these, using live talkers and listeners, 
with the reference system, adjusting the distortionless trunk until the 
reference system and test connection were judged to provide equivalent 
transmission. (Basic data for the plan were. obtained from two-way 
conversation tests, in contrast to the earlier plan for which tests 
were on a listening-only basis.) The criterion for such equality was 
repetition rate, that is, the rate of occurrence of repetitions requested 
by test subjects.1*?4 Thus, balance was achieved when the WRS (with 
a particular trunk setting) and the test connection provided the same 
repetition rate. The WRS trunk setting, in dB, was then a measure 
of the effectiveness of the circuit under test. The reference trunk 
setting for the Working Reference System was selected to provide 
numerical equivalence of ratings obtained with the new plan and its 
predecessor. With reference trunk setting, the Working Reference 
System had an effective loss of 18 dB which was also its transmission 
equivalent in terms of the earlier plan. 

During the early 1930s when the effective loss plan was under 
development, in-plant telephone set transducers were predominantly 
of the resonant type, characterized by large amounts of amplitude 
and nonlinear distortion, telephone sets were largely of the sidetone 
variety, and a number of low cutoff cable loading systems were in 
use. However, improved transducers and antisidetone sets were being 
introduced into the plant, and additional improvements were under 
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development. (The transitional nature of the telephone set plant 
at that time and additional planned improvements were important 
considerations leading to adoption of the effective loss plan.) This 
resulted in introduction of the 300-type telephone set into the plant 
in 1937.57 Subsequent study similarly resulted in introduction of the 
500-type telephone set into plant in 1950.1%° The effective loss plan 
encouraged these improvements and, also, the adoption of higher 
cutoff cable loading systems. - 

By the mid 1950s, telephone sets provided sufficient antisidetone 
and freedom from amplitude and nonlinear distortion that little could 
be gained by further improvements. Thus, loudness appeared to be 
once again a major variable factor in plant design, likely to be the 
most important consideration in subsequent telephone set design. 

The uniformly high level of transmission performance characteriz- 
ing the plant suggested revision of telephone transmission planning 
to reflect more emphasis of loudness effects. Impetus for such revision 
was provided by (2) the availability of a speech loudness computation 
technique and a laboratory measuring system based on this technique 
and (2) the need for planning in simpler terms than was possible 
with the effective loss plan which suffered from a number of practical 
disadvantages.* 


2.3 Loudness Rating Plan® 


For present purposes, the loudness rating plan comprises definitions 
for loudness ratings of partial and overall telephone connections.* 
Loudness is a subjective quantity and, therefore, loudness rating is 
defined as the ratio of the loudness of the speech into the listener’s 
ear to the loudness of the speech out of the talker’s mouth. As used 
here, however, loudness rating is considered to be determined by 
objective measurements. 

The rating definitions involve both acoustic pressures and electric 
voltages. The EARS provides a means of measuring these quantities. 
The EARS consists of a source of acoustic energy, comprising a com- 
plex voice-frequency test tone which simulates certain properties of 
human speech and an artificial mouth, and an indicating meter for 
measuring voltages or sound pressures in a manner which simulates 
the loudness perception of an average listener. 


Ratings obtained using the EARS do not precisely equal ratings 
* Loudness ratings as specified by the definitions are expressed in decibels. 


Such usage does not strictly conform to the definition of the decibel, but this 
should have no effect on general use of the term “decibel.” 
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obtained using human beings, in part because the artificial mouth and 
the 6-em* coupler do not precisely duplicate the characteristics of their 
human counterparts. However, EARS ratings are sufficiently accurate 
to be highly useful in telephone transmission engineering although some 
problems associated with using the EARS to measure nonlinear trans- 
ducers, e.g., carbon microphones, remain to be solved. Continuing 
research aimed at characterizing human mouths and ears may result in 
artificial counterparts the use of which in the EARS will render the 
differentiation between EARS and subjectively determined loudness 
ratings unnecessary. 

The EARS is thus an instrument used to measure the loudness 
performance of telephone instruments and facilities in much the same 
manner that the 3A Noise Measuring Set is used to measure the 
subjective magnitude of telephone message circuit noise.?? Loudness 
performance is expressed as constants and/or families of curves which 
are used in transmission planning. Examples of such use are the 
unigauge plan already cited* and current studies which are con- 
templating transmission design based on (7) noise and loss and on (it) 
noise, loss, and echo.® 

Earlier rating plans utilized physical reference systems which 
simulated overall telephone connections. Loudness rating as considered 
here is based on measurement of physical quantities, and does not 
require specification of a standard simulated connection. 


III. COMPUTATION OF LOUDNESS OF SPEECH 


A number of tests have been conducted to determine the loudness 
of telephone speech signals. Study of these test results, and of various 
methods devised for computing the loudness of speech and tones, 
resulted in a particular speech loudness computational procedure. This 
procedure, developed some time ago but not previously reported in 
the literature, is discussed in some detail in the present section. 

We begin with a summary of the essential attributes of the pro- 
cedure with appropriate reference to functions required in the com- 
putation of loudness ratings. Following this, several methods for 
computing the loudness of continuous spectrum sounds are referred 
to, and the background for the specific computation method of con- 
cern herein is reviewed. We then derive the functions required for 
the computational procedure. Finally, the procedure is used to com- 
pute the loudness performance of several laboratory test systems, and 
the computed values are compared to experimental results. 
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The computational method is not based on fundamental properties 
of speech sounds and of the hearing mechanism and, therefore, is 
probably not generally applicable in determining the loudness of 
other types of sounds. Rather, the method is based on a “black box” 
approach in which a defined “speech” signal is applied at the input, 
and the signal so processed that the output of the black box cor- 
relates with subjective experience of speech loudness. Internal opera- 
tion of the black box is specified, but such is not intended to reflect 
exactly the specific manner in which the human being processes speech 
stimuli to obtain loudness although similarities are noted. This ap- 
proach is used because, although there are a number of theories 
covering loudness perception, the specific operation of the hearing 
mechanism in determining loudness is not known. 


3.1 Description of Speech Loudness Computational Method 

The method, depicted on Fig. 1, is based on performing certain 
operations on the pressure spectrum delivered by a telephone con- 
nection to the ear of a listener. In essence, these operations comprise 
(z) dividing the received speech spectrum into a number of different 
frequency bands, (i) determining the loudness due to each band, 
and (222) summing across all bands to obtain the total loudness. 

The received pressure spectrum consists of (7) a reference speech 
spectrum applied at the transmitting end of a connection modified 
by (i) the amplitude transfer characteristic of the connection. The 
compromise spectrum and system response definitions are given on 
Figs. 2 and 3 respectively.* 

Not all of the received spectrum contributes to loudness, i.e., that 
portion of the spectrum lower in level than the threshold of hearing 
does not contribute. Account is taken of this by defining a quantity 
termed effective spectrum which is the received spectrum minus X, 
the threshold of audibility for continuous spectrum sounds. The X 
function is shown on Fig. 4. 

The effective spectrum is divided into 50 frequency bands selected 
such that each contributes equally to the total loudness produced by 
a flat effective spectrum. The frequency limits for the “2 percent” 
loudness bands can be derived from the function of Fig. 5. 

The effective level in each band is then converted to loudness, in 
loudness units (LU), using the function of Fig. 6, and the loudness 


* The ordinate of Fig. 2 is in terms of dBt. For purposes of this paper, dBt 
= dB relative to 2 x 105 newton/meter?; 0 dBt = 20 logw(2 « 10-5). 


ELECTRO-ACOUSTIC RATING SYSTEM 2675 


BANDPASS 
FILTERS CONVERTER 


























TOTAL 





















= LOUDNESS, 
COMMUNICATION EFFECTI N 
REFERENCE s 
SPEECH SYSTEM SPEECH (LQUONESS 
FREQUENCY UNITS) 
SPECTRUM 
Ls (aBt) RESPONSE, R " 
(dB), += LOSS 


=n; 
i=1 








THRESHOLD OF 
AUDIBILITY FOR 
CONTINUOUS SPECTRUM 
SOUNDS, X (dBt) 


NOTE: Lg, R, X AND Zg ARE 
FUNCTIONS OF FREQUENCY. 


Fig. 1—Computation of speech loudness. 


units summed for the 50 bands. The resulting sum, N,, represents 
the total loudness contained in the particular received spectrum. 

Once the total loudness, N,, is obtained, the problem of its inter- 
pretation arises. One useful way of expressing the loudness is in terms 
of loudness level.* Functions which may be used to convert N, to 
loudness level are shown on Fig. 7. 

A more significant way of expressing loudness for speech com- 
munication problems is in terms of the level of a reference speech 
spectrum. The function relating total loudness to the level of the 
selected reference speech spectrum is given on Fig. 8. 


3.2 Survey of Loudness Studies 


Loudness has been studied extensively as demonstrated by the 
numerous publications relating to this subject. Studies reported have 
been largely concerned with single-frequency tones, tone complexes, 
and continuous spectrum steady sounds. Several different methods 
for computing the loudness of these different sounds have evolved.* 
Some of the methods, specifically those covering continuous spectrum 


* The loudness level of a sound, in phons, is numerically equal to the median 
sound pressure level, in dBt, of a free progressive wave of frequency 1000 Hz 
presented to listeners facing the source, which in a number of trials is judged by 
the listeners to be equally loud. The unit of loudness is the sone. By definition, 
a 1000-Hz tone 40 dB above a listener’s threshold produces a loudness of 1 sone; 
the loudness of any sound that is judged by the listener to be n times that of 
the 1-sone tone is n sones.21 In this paper, we will use the term loudness units 
(LU) when considering loudness of speech, and the term sones in discussing other 
ou However, speech of N LU is equal in loudness to any other sound of 

sones. 

+ References 22 through 27 describe some of these computational methods. 
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SOUND PRESSURE SPECTRUM LEVEL IN dBt- 
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Fig. 2—Sound pressure spectrum. of continuous speech at 2 inches from the lips 
of a talker. 


steady sounds, are sufficiently general that with appropriate specifica- 
tion of the speech signal they may be suitable for computing the 
loudness of speech.* Such suitability has not, to the author’s knowl- 
edge, been demonstrated and these methods are not considered in this 
report. 

The study of speech loudness has received somewhat less attention 
than the general subject of loudness. Publications covering the study 
of speech loudness are limited in number, and of these, only a few 
consider speech loudness computational methods.t 

In 1924, H. Fletcher and J. C. Steinberg devised a method for 
computing the loudness loss, due to changing the transmission system 
response, of a sound being transmitted to the ear.2® Results obtained 
using the method were in close agreement with experimental results 
for the specific sounds—speech and a complex test tone—considered 
in the study. 


* See, for example, Ref. 26. 

+ References 28, 30, 31, 32, and 33 report results of some speech loudness tests. 
etnias 28, 29, and 31 describe speech loudness computation methods de- 
veloped. 
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In 1925, Steinberg developed a more general method of computing 
the loudness of any complex sound.?® (The method referred to in the 
preceding paragraph was a special case.) Results computed using this 
method agreed with experimental results then available, including the 
data reported in the 1924 paper. 

In 1938, W. A. Munson developed a method for computing the 


(A) ORTHOTELEPHONIC RESPONSE DEFINITIONS 


TRANSMITTING LINE RECEIVING 
RESPONSE = RESPONSE = RESPONSE = 


20 LOG V1/P \ 20 LOG V2/V1 20 LOG P2/V2 





P; = PRESSURE AT 2 IN. IN FRONT OF REAL VOICE, TRANSMITTING ELEMENT ABSENT. 
Vi, = OPEN CIRCUIT VOLTAGE OF TRANSMITTING ELEMENT,:WHEN SPOKEN INTO BY A REAL VOICE. 


Po = PRESSURE IN FREE FIELD AT CENTER OF LINE BETWEEN OBSERVER’S RIGHT AND LEFT EARS, 
OBSERVER ABSENT. 


V2= VOLTAGE ACROSS RECEIVING ELEMENT WHEN SOUND FROM RECEIVING ELEMENT IS AS LOUD 
AS THE SOUND FROM THE FIELD. 


NOTE: PRESSURE AND VOLTAGE REFERENCE LEVELS MUST BE STATED. 


(B) ORTHOTELEPHONIC SYSTEM 








ORTHOTELEPHONIC TRANSMITTING ELEMENT, RESPONSE = K 
a RECEIVING ELEMENT, RESPONSE = —K 


7 LINE, RESPONSE = —25 


AS DEFINED 
IN SECTION A 





OVERALL, OOS = —25 
NOTE: K = A CONSTANT AS A FUNCTION OF FREQUENCY. LINE RESPONSE REFLECTS 
DIFFERENCE BETWEEEN THE POSITION AT WHICH PRESSURE, P1,IS MEASURED AND 
100 cm POSITION OF LISTENER’S HEAD PER SECTION C. 
(C) ACOUSTIC PATH SIMULATED BY ORTHOTELEPHONIC SYSTEM 


TALKER LISTENER 


NON-REVERBERENT, 
QUIET ROOM ONE EAR SEALED 






Tot = ORTHOTELEPHONIC RESPONSE OF GIVEN TRANSMITTING 






ELEMENT 
7 AS DEFINED 
Rot = Deuces one RESPONSE OF GIVEN RECEIVING IN SECTION A 





RESPONSE OF GIVEN LINE 





OVERALL; OTS = Tot + Rot +L 


DEPARTURE FROM ORTHOTELEPHONIC SYSTEM, D=OTS—OOS= Tot+Rot+L+ 3 


Fig. 3—Orthotelephonic transmission. 
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X IN dBt 


NOTE: 
X IS THE SAME FOR MONAURAL 
AND BINAURAL LISTENING. 
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Fig. 4—X, the threshold of audibility for continuous spectrum sounds. 


CUMULATIVE CONTRIBUTION TO LOUDNESS IN PERCENT 





100 200 400 600 1000 2000 4000 6000 10,000 
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Fig. 5—Cumulative contribution to loudness of speech for a flat effective 
spectrum. 
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NOTES: 
(1) CURVE APPLIES FOR MONAURAL 
LISTENING. 


(2) EACH OF THE 50 BANDS OF UNIT 
IMPORTANCE CONTRIBUTES 2% 
OF THE LOUDNESS WHEN Zs = a 
CONSTANT ACROSS THE BAND 
OF INTEREST (100-5000 Hz). 


LOUDNESS, n,, IN LOUDNESS UNITS (LU) 
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EFFECTIVE LEVEL, Zs, IN DECIBELS 


Fig. 6—Loudness (n,) in a band of unit importance vs effective level (Z;) in 
that band. 


loudness of speech. The method was based on speech loudness tests 
conducted in 1935 and 1936.* 

Thus, there were available at least two methods, those of Steinberg 
and Munson, for computing the effects of changes in the response 
characteristics of telephone circuits on-speech loudness. These methods 
had been found to provide computed results which were reasonably . 
consistent with observed results from experiments on which the 
methods were based. A review of these experiments revealed sub- 
stantial inconsistencies between the different sets of data, leading to 
doubt concerning the generality of the procedures. Moreover, a sub- 
stantial amount of additional speech loudness data became available 
in 1939 and 1940. These considerations led to a review of the available 
speech loudness computation procedures and the various sets of sub- 
jective test data for purposes of arriving at a simple procedure, suitable 
for telephone engineering purposes, which would be reasonably con- 
sistent with all of the data. 


* Neither the computational procedure nor the tests were reported in the 
literature. 
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NOTE: Ze a 
THESE CURVES ASSUME A “REFERENCE” OBSERVER WHOSE 
““ 
““ 


THRESHOLD OF AUDIBILITY FOR A 1000 Hz TONE IS OdBt 


LOUDNESS IN SONES 


DATA FROM H. FLETCHER AND W.A. MUNSON (SEE REF. 22). 
THE CURVE ACTUALLY PLOTTED WAS OBTAINED FROM REF. 34 
AFTER ADJUSTING THE NUMBERS GIVEN THEREIN SO THAT A 
LOUDNESS OF 1 SONE CORRESPONDS TO A LOUDNESS LEVEL 
OF 40 PHONS. 


DATA FROM H. FLETCHER (SEE FIG. 137 OF REF. 27). 
FROM S.S. STEVENS (SEE TABLE 7 OF REFERENCE 37). 
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Fig. 7—Relation between loudness and loudness level. 


The speech loudness computation procedure resulting from the 
review referred to above is described in subsequent paragraphs. This 
procedure, developed in 1941, and the 1939-40 speech loudness data 
have not been previously reported, primarily for two reasons. First, 
the group concerned with the speech loudness project was essentially 
disbanded in late 1941 and assigned other work in support of the war 
effort. This group was not re-established so that its accomplishments 
could be reduced to a form suitable for publication in the literature. 
Secondly, there was no pressing need for the procedure until the loud- 
ness rating plan was proposed.’ Review initiated at that time has 
verified applicability of the procedure to telephone speech loudness 
problems. | | 


3.3 Development of the Speech Loudness Computation Method 


The speech loudness computation method evolved from a specific 
method devised by Fletcher and Munson for treatment of continuous 
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spectrum sounds, and uses as many of the steady-state definitions as 
are applicable.?* Review of the Fletcher-Munson method therefore 
serves as a convenient introduction. 


3.3.1 Steady-State Sounds—Definitions and Formulae 

Loudness, N, is defined as the intensive attribute of an auditory 
sensation. Loudness can be expressed in terms of a scale whose units 
have been found to agree with the common experience of observers 
estimating the intensive attribute of sounds. Ordinarily, the loudness 
of a sound is not specified in terms of units of a loudness scale, but, 
rather, in terms of its loudness level. (See Section 3.1 footnote.) 

Results of some experiments to determine the relation between 
loudness and loudness level are shown on Fig. 7. The solid curve, due 
to Fletcher and Munson, was first proposed in 1933.?? 

This curve was later adopted in an American Standard.** Further 
study by Fletcher and Munson resulted in a slightly different relation 
shown by the long dashed line. This curve, first reported in 1937,?* 
was later modified slightly.* The short-dashed curve, due to S. S. 
Stevens, resulted from effort directed at obtaining the best fit for all 
available data with the simplest possible relation.?**°°7 This relation 
is included in a USA Standard.** 

The loudness, N, for steady-state sounds can be computed from the 
masking spectrum, M, of the sound.* The loudness equation may be 
written as 


N = | ran dz. (1) 


F'(M), expressed in loudness units, is a function of the masking, and 
may be interpreted as expressing the intensity of the nerve stimulation 
at a particular position, x, along the basilar membrane. The quantity, 
x, is expressed in terms of the percent of total nerve endings that the 
maximum stimulation passes over as a stimulating tone is changed 
from lowest audible frequency to a frequency f, corresponding to 2. 
The product F(M) and dz represents the loudness contributed by a 
sound within the differential length dx and, consequently, the integra- 
tion represents the total nerve stimulation, or loudness, over the length 
of the basilar membrane. 

For continuous spectrum sounds, the masking, M,* at any fre- 


* Figure 137, Ref. 27. 

+See Ref. 23 and 39 for more detailed discussion of the concepts considered 
in this and subsequent paragraphs. 

+ M = B — Bo where 8 = the level of a single frequency tone which is just 
audible in the presence of a specified noise and fo = the level of the same tone 
which is just audible when the noise is absent. (See Ref. 39.) : 
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Ns IN LOUDNESS UNITS (LU) 


NOTES: 

1. LOUDNESS LEVEL, Ls,IS THE LEVEL, EXPRESSED IN cBt, OF A REFERENCE 
SPEECH SPECTRUM FOR WHICH THE COMPUTED LOUDNESS, IN LU, IS Ns. 
THE REFERENCE SPEECH SPECTRUM IS AVERAGE SPEECH (SEE FIG. 2) 
TRANSMITTED OVER A CIRCUIT THAT IS FLAT ON AN ORTHOTELEPHONIC 
BASIS WITH SHARP CUTOFFS AT 250 AND 4000 Hz. 


2. COMPUTATION OF Ns INCLUDED SPREAD-OF-LOUDNESS EFFECTS. 
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Fig. 8—Loudness of reference speech. 


quency is related to the effective level, Z, of the masking sound at 
the same frequency. Thus, the loudness formula (1) can also be 
expressed as follows: 


N= i Q(z) dv. (2) 


Interpretation of this formula is the same as that for (1) in that it 
represents the integrated nerve stimulation over the length of the 
basilar membrane. However, the intensity of the stimulation at any 
frequency is determined from the effective level of the continuous 
spectrum sound, Z, defined as follows: 


Z=B-xX (3) 
where 
Z = effective spectrum level (dB) 
B = pressure spectrum level (dBt) 
xX = a function (dBt), empirically determined from listening tests, 
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which reflects the fact that not all of the objectively measurable 
spectrum (B) is effective in producing loudness. 


The X function plotted on Fig. 4 may be considered to be the thresh- 
old of audibility for continuous spectrum sounds. Although this is 
not strictly correct, it lends physical interpretation to the function 
and, as such, facilitates understanding of the loudness computation 
technique.* 

The loudness, N, of a continuous spectrum sound can be computed 
by (2) dividing the audible frequency range into a number of suc- 
cessive frequency bands conveniently selected for computational pur- 
poses, (27) determining from the effective level and the importance 
of each band the number of loudness units contributed by the band, 
and (27) summing the loudness units over the audible range. This 
procedure, replacing the integration of (2), may be written as 


N = >on, At (4) 
where 
N = total number of loudness units, 
n, = loudness, Q(Z) from equation (2), integrated over a unit length 


of the basilar membrane, and 
Az = the number of unit lengths along the basilar membrane in- 
cluded in the computation band.t 


The quantity, n, , is a function of the effective level, Z, of that portion 
of the sound lying in the computation band. 


3.3.2 Speech Sounds—Definitions and Formulae’ 

Development of the speech loudness formula was based on the 
assumption that a formula of the same general type as (4) could be 
used, that is, the loudness of speech can be computed by dividing 
the speech spectrum into a number of successive bands selected for 


* Consider the case in which no masking occurs, ie., M = 0. Assuming M 
= Z, then Z = B — X = 0, and X = B, the pressure spectrum level of a noise 
which produces zero masking, hence zero loudness. X can thus be defined as the 
threshold of audibility for continuous spectrum sounds, useful for the purpose 
indicated, but not strictly correct since it has been found that at low values of 
- eo = 0), the relationship M = Z is not valid. (See equation (10-5) of 

ef. 27. 

t i. Ref. 23, the unit length was taken as 1 percent of the basilar membrane 
ength. 

* Many of the steady-state definitions of Section 3.3.1 apply also for speech. 
However, since some of the functions differ, the subscript s is used where 
appropriate to denote those specifically applicable to the speech case. 
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convenience of computation, determining the loudness contributed by 
each band, and summing the loudness over the audible range. This 
can be expressed as follows: 


= SS n, AS (5) 
where 


n, = total loudness produced by speech in a frequency band of unit 
importance 

the number of bands of unit importance included in a com- 
putation band. 


AS 


The quantity, 7, , is a function of Z, , the effective level of the received 
speech. Following the definition used in the formulation for continuous 
spectrum steady sounds, Z, is defined as follows: 


Z, = B,—X (6) 
where 


B, = sound pressure spectrum level of received speech (dBt)* 
and X is defined in equation (3). 

The bands of unit importance for equation (5) are so chosen that 
no matter where they are located on the basilar membrane, each 
contributes an equal amount to the total loudness when the effective 
level of the speech is independent of frequency, i.e., Z, = k (a con- 
stant) across the audible range.* When the speech band is divided into 
a number of bands suitable for computational purposes, each of these 
may contain several bands of unit importance. Thus, the loudness, n, , 
contributed by a band of unit importance must be multiplied by the 
number of such bands, AS, in the computation band. The value of 
AS used for any particular computation band depends on the width 
of that band (in Hz) and its location on the frequency scale. 

The summation of AS along the frequency scale represents the 
cumulative number of bands of unit importance. The rate of change 


*Speech varies rapidly in level with time. Accounting for this variability 
would have entailed development of complex formulae from the limited amount 
of applicable data available on speech. The approach followed was to develop 
the simplest possible method of computing speech loudness from the long-term 
average sound pressure spectrum of speech: where long term is understood to 
comprise a time interval which is long compared to the average syllabic interval. 

+ Selection of the bands is based on a flat effective spectrum for purposes of 
simplicity. Any other shape, Le., Z, + k, would require a different loudness 
versus Z, function for each band of unit importance. 


ELECTRO-ACOUSTIC RATING SYSTEM 2685 


of S with frequency may, therefore, be interpreted as showing the 
relative importance to loudness of equal effective levels of speech in 
different frequency regions. This interpretation of AS and the inter- 
pretation of n, as the loudness of a band of unit importance, based 
on the similarity of the quantities n, and Ax in equation (4) and n, 
and AS in'equation (5), are probably not exact because of the radically 
different nature of speech and steady-state sounds. These interpreta- 
tions should, therefore, be applied with caution, and regarded pri- 
marily as a point of view helpful in systematizing the basic data and 
in understanding the loudness formulations. 


3.3.3 Receiwed Speech Spectrum 

For any given telephone connection, the sound pressure level of the 
received speech [B, of equation (6)] and, hence, the loudness of the 
speech depends on (2) the speech spectrum applied at the talking end 
of the connection and (i) the loss of the connection as a function of 
frequency. These factors are taken into account in the speech loud- 
ness computation method by assuming a compromise speech spectrum 
and expressing the connection loss in orthotelephonic terms. 


3.3.3.1 Compromise Speech Spectrum. The compromise spectrum 
adopted is shown on Fig. 2 and is designated By. because the area under 
the curve integrates to a total sound pressure level of 90 dBt.* This 
spectrum was obtained by averaging and smoothing long average power 
measurements of continuous speech (including pauses between words) 
from 13 males and 12 females. Although some differences in spectral 
content were found between individual voices, the shapes of the spectrum 
curves were sufficiently alike to justify averaging across all voices. 
Moreover, measured spectra from other studies exhibit similar 
shapes.**~* 

Several experimenters have reported measurements of the long- 
term average sound pressure level of speech. Results of these experi- 
ments, referred to a point 2 inches in front of a talker’s lips, are given 
in Table I. (Selection of the 2-inch reference point is discussed in the 
next section.) 

The values are in fair agreement excepting the last row. Benson and 
Hirsh considered the apparent discrepancies at some length, con- 
cluding that values reported by Dunn and White and by Rudmose, 


* This spectrum is the same as that used in computing articulation index. See 
Fig. 2 of Ref. 39. 

+ Spectrum curves reported in Ref. 40 provided some of the data used in 
deriving this compromise spectrum. 
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TasBLE I—Lonc-TEerRmM AVERAGE SOUND PRESSURE LEVEL OF SPEECH 
AT 2 INCHES FROM TALKERS’ LIPS 


Males Females 
Pressure -Pressure 
Source No. (dBt) No. (dBt) 


Sivian—Ref. 45 5 

Dunn and Farnsworth—Ref. 44 1 — — 
Dunn and White—Ref. 40 6 90 .8* 5 88 .0* 
Rudmose, et al.—Ref. 42 7 — 

Benson and Hirsh—Ref. 438 5 81. 7* 5 


* Values reported were converted to apply at 2 inches by assuming an equivalent 
point source 0.6 cm behind the talker’s lips. See Ref. 44. 


et al., “approximate more nearly monitoring levels (peaks of energy 
probably corresponding to vowel sounds) than to an actual average.’’** 

There is thus some uncertainty regarding the sound pressure level 
at 2 inches from a real voice. For present purposes, the level is as- 
sumed to be 90 dBt, an approximate average for the majority of the 
data (specifically excludes the Benson-Hirsh data). As will become 
evident, the exact value assumed for the level at 2 inches does not 
significantly affect the speech loudness computational procedure as 
long as the spectrum shape (see Fig. 2) is not changed. 


3.3.3.2 Orthotelephonic Transmission. The concept of orthotelephonic 
transmission is based on relating telephone and face-to-face conver- 
sation.’° Specifically, orthotelephonic transmission for present purposes 
implies reproduction by a telephone system of speech sounds which are 
indistinguishable from those received with an air transmission system. 
Conditions applicable for the latter case are negligible noise and acoustic 
reflections when listening monaurally at a distance 1 meter from the lips 
of the talker, the listener facing the talker. An orthotelephonic system 
is thus one that provides the required orthotelephonic reproduction of 
the speech sounds. 

Application of the orthotelephonie concept to telephone connec- 
tions is detailed in Fig. 3. The definitions are given here as a matter 
of convenience, and will be considered where appropriate in later 
discussion. Typical orthotelephonic amplitude responses of overall 
telephone connections employing specific telephone set types are given 
in Ref. 16. Such responses vary widely depending on the telephone 
set and transducer types used. 

One point worth noting is that the orthotelephonic transmitting 
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response definition of Fig. 3 is in terms of the sound pressure level at 
2 inches from a talker’s lips whereby strict conformance with the 
orthotelephonic definition would require expression in terms of the 
1-meter sound pressure level. This change was made for two reasons: 
(2) the 1-meter reference involves a long acoustic path which is 
ascribed to the transmitter; (77) measurements at 2 inches are more 
reproducible since they are less likely to be affected by characteristics 
of the surrounding environment, e.g., reverberation. However, studies 
show that under free-field conditions the sound pressure spectra at 
2 inches (approximately 5 cm) and at greater distances have about 
the same shape,*** but differ in absolute level by about 25 dB for 
the 1-meter and 2-inch positions.*# 


3.3.4 Derivation of n, and S Functions 

The speech loudness formula [equation (5)] involves the two 
quantities n, and S. As noted earlier, n, is a function of Z, and is the 
loudness due to that portion of speech contained in a frequency band 
of unit importance; S is a function of frequency which shows the 
relative importance to total loudness of equal effective levels of speech 
in different frequency regions. These functions, though somewhat 
interrelated, may be thought of as providing level weighting and fre- 
quency weighting respectively. 

Data from three tests were used in deriving these functions. The n, 
function was based on tests by Munson the results of which show 
how loudness varies as a function of received speech level. Data 
obtained by Steinberg and by Van Wynen were used to derive the S 
function. These data show the manner in which loudness depends on 
frequency content of the received speech signal. Other data were 
available, but were not used because (2) there were gross inconsisten- 
cies between the data and data from other tests and/or (2) the test 
system and conditions were not reported in sufficient detail to permit 
utilization of the test results. 

An important factor in deriving the n, and S functions is Z,, the 
effective level of the received speech. [See equation (6).] Effective 
levels for the test results used in the derivation are shown on Fig. 9. 
The reference effective spectrum curve, consisting of the compromise 
speech spectrum, Bg), of Fig. 2 minus the threshold function, X, of 
Fig. 4, is also plotted on Fig. 9. This function, utilized in the speech 
loudness computation technique later, is assumed to represent the 
average case for a large number of talker-listener pairs conversing 
over a telephone system with (z) orthotelephonic response and (7) 25 
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“i —--— MUNSON TESTS (1935-1936) 


(SEE SECTION 3.3.4.2) 

VAN WYNEN TESTS (1940) 

(SEE SECTIONS 3.3.4.1 AND 3.5.2) 
STEINBERG TESTS (1924) 

(SEE SECTIONS 3.3.4.1 AND 3.5.1) 
REFERENCE EFFECTIVE SPECTRUM EQUALS 


COMPROMISE SPEECH SPECTRUM (FIG. 2) 
MINUS THRESHOLD (FIG. 4) = Bgg— X 


EFFECTIVE SPECTRUM LEVEL IN DECIBELS 


NOTE: THE EFFECTIVE SPECTRUM LEVEL CURVES FOR THE 
LOUDNESS TESTS HAVE BEEN ADJUSTED SO THAT THE 
LONG-TIME AVERAGE SOUND PRESSURE IS ABOUT 80 dBt 
IN ORTHOTELEPHONIC TERMS. 





100 200 400 600 1000 2000 4000 6000 10,000 
FREQUENCY IN Hz 


Fig. 9—Effective spectrum level curves. 


dB of flat gain relative to a true orthotelephonic system. The function 
is given here to illustrate the difference between the effective spectrum 
level curve for a system providing orthotelephonic response and the 
effective spectrum level curves for the Steinberg and Van Wynen test 
systems. 

Because n, and S functions were interrelated, their derivations were 
carried out concurrently by means of a series of successive approxima- 
tions. For simplicity, the two functions are treated separately in suc- 
ceeding sections, and the derivation is described in greatly simplified 
terms. The actual derivation involved obtaining a first estimate of 
the S function, using the Steinberg and the Van Wynen data (includ- 
ing the respective loudness law exponents—see later discussion), for 
correcting the actual Z, to a constant Z,. This S function was then 
applied to the Munson data in order to obtain a curve of total loud- 
ness versus speech sound pressure level for a 100-5000 Hz band, the 
band of interest in the speech loudness computation method. The 
ordinate and abscissa of this ‘curve were then corrected to reflect 
respectively the loudness in a band of unit importance and the Z, 
in a band of unit importance. 
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These first estimates of the S and n, functions were used to compute 
the loudness for a number of conditions. Computed results were com- 
pared to experimental results, and the functions modified as indicated 
by this comparison. This process was continued until computed values 
reflected adequate accuracy for engineering purposes. 


3.3.4.1 The S Function. The S function was derived from results of 
tests by Steinberg (see Section 3.5.1) and by Van Wynen* (see Section 
3.5.2). In these tests, speech received through high- and low-pass filters 
was balanced against undistorted speech. Test results important in 
the derivation are given in Fig. 10. The intersection of smooth curves 
drawn through the test results provides two items of information: 
(2) fso , the frequency above and below which the loudness contributions 
are equal;' (72) LL; , the number of dB by which undistorted speech 
must be reduced for a 50-percent reduction in loudness. 

By assuming a logarithmic relationship between loudness and effec- 
tive level (see Section 3.3.4.2), we can determine the percentage of 
loudness contribution as a function of frequency. Thus, 


Ve ar Les = k loge ja ; (7) 


where Z,,; and Zs are effective levels before and after a flat change 
in undistorted speech level, N,; and Nz» are corresponding loudness 
numerics, and k is a constant. For the Steinberg tests, we find, sub- 
stituting —9.1 dB for Zs. — Z,, and 1/2 for N./N, (50-percent loud- 
ness reduction), that 


while for the Van Wynen tests, k = 41.2 dB. 

We can express the relationship of equation (7) in the power func- 
tion form frequently encountered in current literature®*** by letting P 
represent the stimulus magnitude and setting it equal to Antilog Z/20. 
Then 


N= PF — (8) 


* Steinberg reports results for sensation levels = 100 dB and 39 dB.29 (Sensa- 
tion level is the level above threshold. See Ref. 21.) Only the 100-dB results 
were used in the derivation. Results for the 39-dB level were near the knee 
of the loudness curve (see Fig. 8 for example) and their inclusion would have 
substantially complicated the derivation. 

+t This assumes ideal filters. The filters used in the tests appear, for speech 
signals, to be sufficiently close to ideal to warrant making this assumption. 
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NOTE: 1. fg9 = FREQUENCY ABOVE WHICH AND BELOW 
WHICH, 50% OF LOUDNESS IS CONTRIBUTED. 


x 2. LLgg = LOUDNESS LOSS FOR 50% 
io DECREASE IN LOUDNESS. 
x 3. LPF = LOW PASS FILTER 
Ke HPF = BAND PASS FILTER. 
x 


STEINBERG TESTS (1924) 
SENSATION LEVEL = 100 dB 





a f5o = 820 Hz 
w LL5o = 9.1dB 
S LONG AVERAGE PRESSURE 
wu VAN WYNEN TESTS LEVEL (dBt) IS ABOUT 4.5dB 
2 SE GREATER THAN THE 
2 (1940) SENSATION LEVEL (dB). 
5 fs0 = 740 Hz 
- LLso = 12.4 dB H(Hz) HPFix) LPF (@) 
g LEVEL OF UNDISTORTED 125 7 
a SPEECH CORRESPONDING 250 - 
= TO OdB LOUDNESS LOSS 375 7 
- WAS ABOUT 80 aBt. 500 = 
f(Hz) HPF(A) LPF(O) 99 
250 51 
550 10. ‘ 
os 44 
1000 14.6 31 
1750 24 
2750 18 
0.7 
100 $ : . p 8 000 ‘ 2 4 ae 10,000 


FREQUENCY IN Hz 


Fig. 10—Filter test results. 


where N and P are as defined above and n = 20/k. The values of n 
are then 0.662 and 0.486 for the Steinberg and Van Wynen tests 
respectively. 

By applying equation (7) to the curves of Fig. 10, we can determine 
the loudness reduction corresponding to various values of loudness 
loss. For example, the Van Wynen tests show that for a high-pass 
filter with cutoff frequency = 550 Hz, Zs. — Zs1 = —10.3 dB. Using 
the equation, we find that the 550-Hz high-pass filter passes 56.2 
percent of the loudness, suppressing 43.8 percent. Proceeding in a 
similar manner for all of the data, the points shown on Fig. 11 were 
obtained. It is of interest to note that there appear to be no systematic 
differences between the Steinberg and Van Wynen test results even 
though the exponents referred to in the preceding paragraph are 
somewhat different. 

Because the effective spectrum level curves for tests were almost 
identical (see Fig. 9), the smooth curve of Fig. 11 was drawn to 
represent all of the data points. This curve shows the percent of total 
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loudness contributed by frequencies below the frequency of the 
abscissa for the effective spectrum level used in these tests. 

The S function required, showing the cumulative contribution to 
loudness when the effective spectrum level is flat with frequency, is 
plotted on Fig. 5. The function was obtained from the curve of Fig. 
11 by successive approximations as outlined in the introductory 
paragraphs of Section 3.3.4. 


3.3.4.2 Then, Function. The n, function, relating loudness and effec- 
tive speech level in a band of unit importance, was derived using test 
results obtained by Munson in 1935-1936. The test system and general 
methodology are described in Ref. 46. However, the test results have 
not previously been reported in the literature. 

In the Munson tests, undistorted speech was loudness balanced 
against a 1000-Hz test tone. Also, thresholds were determined for the 
1000-Hz and speech signals. Detailed results of the tests are given in 
Tables II and III. Average results are plotted on Fig. 12. 

The ordinate of Fig. 12 was converted to a loudness scale using the 
solid curve of Fig. 7. The resulting relation between ‘loudness and 


@ HPF 
x LPF 
O HPF 

A LPF VAN WYNEN 


} STEINBERG 


OA BPF 


NOTES: 

1, SEE SECTION 3.5.2 FOR DATA 
ON BANDPASS FILTERS. 

2. POINT FOR VAN WYNEN HPF 
= 1000 Hz IS PLOTTED AT 
1100 Hz TO AVOID CONFUSION. 


CUMULATIVE CONTRIBUTION TO LOUDNESS IN PERCENT 





100 200 400 600 1000 2000 4000 6000; 8000 
FREQUENCY IN Hz 


Fig. 11—Cumulative contribution to loudness of speech for the Steinberg and 
Van Wynen filter tests. 
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TaBLe II—Resutrs or Munson (1935-1936) SpEEcH LoupneEss TEstTs 
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tes of the 1000-Hz tone level (in dBt) judged to be equal 


ima 
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* Each observer participated in N tests, providing X1, X2, --+ , Xy esti 


loudness to the designated speech pressure level. 


in 
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TaBLeE II]—Resuuts or Munson (1935-1936) THRESHOLD TEsts 


Subject 
Designation 




















Speech 1000-Hz Tone 
X-dBt o-dB N X-dBt o-dB N 
12.7 0.8 4 2.2 0 1 
10.3 1.9 4 7.9 0 1 
16.0 3.1 4 5.6 0 1 
16.9 2:0 4 5.9 0 1 
12.4 0.6 3 — — — 
12.7 1.2 3 4.3 0 1 
10.4 1.3 2 4.5 0 1 
8.0 2.5 4 2.6 0 1 
12.7 0.3 4 5.7 0 1 
11.9 3.6 4 1.2 0 1 
11.9 0.9 3 1.6 0 1 

X-dBt S-dB X-dBt  S-dB 
12.4 2.5 5.1 3.2 


speech sound pressure level is shown on Fig. 13. (The ordinate of Fig. 
13 is in terms of Loudness Units—LU rather than sones. See footnote 
of Section 3.1.) 

In terms of equation (8), the curve of Fig. 13 at higher sound 
pressure levels represents an exponent of 0.455. This is in close agree- 
ment with the Van Wynen test results with exponent = 0.486 (see 


SOUND PRESSURE LEVEL OF 1000 Hz TONE IN dBt 


100 


90 


80 


70 


60 


50 


40 


30 


0 DATA POINTS (SEE 
TABLES II AND III) 


20 





0 10 20 30 40 50 60 70 80 90 100 
SOUND PRESSURE LEVEL OF SPEECH IN dBt 


Fig. 12—Loudness balance test results for a 1000-Hz tone and speech (Munson, 


1935-1936). 
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‘LOUDNESS IN LOUDNESS UNITS (LU) 





0 10 20 30 40 50 60 70 80 90 100 
SOUND PRESSURE LEVEL OF SPEECH IN dBt 


Fig. 183—Loudness (V,) vs speech sound pressure level for Munson (1935-1936) 
tests. 


previous section), with the “Derived” (from balancing speech against 
a 1-kHz tone) and “Direct” (from adjusting a speech signal to sound 
“half as loud” and “twice as loud” as a standard speech signal) from 
Ref. 32 with approximate exponents of 0.50 and 0.46 respectively, and 
with the derived curve of Fig. 52, Ref. 31, with an exponent of about 
0.51. The exponent does not agree with that from the Steinberg tests 
which was 0.662 (see previous section), with the exponent of about 
0.65 determined from the monaural versus binaural loudness matches 
of Ref. 32, with the exponent of 0.7 determined from the single 
phoneme magnitude estimation study of Ref. 38, and with the ex- 
ponent of 0.6 determined from the limited magnitude estimation work 
reported in Ref. 47. 

The relation of Fig. 18, applicable for wideband speech having the 
effective spectrum shown by the long-dashed curve of Fig. 9, was then 
modified to apply for narrow bands of speech, (bands of unit im- 
portance) and a flat effective spectrum. To do this, effective spectra 
were plotted for several of the loudness balance tests, e.g., see Fig. 9. 
For each of these, the frequency scale was divided into various num- 
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bers of bands of equal importance as determined from Fig. 5. It was 
then assumed that Z, was constant across each band of unit im- 
portance, and was equal in magnitude to the Z, at the center frequency 
of the unit band. This resulted in several “staircase” approximations 
to the smooth curve, each exhibiting a granularity depending on the 
number of bands selected. Study of the various spectra indicated that 
using 50 bands of unit importance, referred to as 2-percent loudness 
bands, represented a reasonable compromise between the conflicting 
requirements of (2) accuracy, improved by increasing the number of 
bands and thus obtaining a closer match between the “staircase” ap- 
proximations and the smooth curves, and (ii) simplicity, obtained by 
reducing the number of bands. 

The required function of n, versus Z,, shown on Fig. 6, was ob- 
tained from Fig. 13 by a series of successive approximations as out- 
lined in the introductory paragraphs of Section 3.3.4. Specific steps 
involved included (7) adjusting the ordinate and abscissa scales of 
Fig. 13 to represent a band of 100-5000 Hz (required making allow- 
ance for the fact that the effective level of the speech spectrum used 
in the Munson tests was not flat with frequency) and (iz) dividing 
the ordinate by 50 (to reflect. contribution by each 2-percent band) 
and by 2 (since the Munson test results and the solid curve of Fig. 7 
are both in terms of binaural listening while the case of usual interest 
in telephony is monaural listening). 


3.4 Computational Procedure 

Mechanics of the computation for any given telephone connection 
consists of two parts, (7) the computation of the loudness, N, , of the 
received speech, and (i) interpretation of the computed loudness 
numbers. The former is carried out utilizing a computational form 
based on Figs. 5 and 9. . 

The computational method to be described applies specifically to 
the monaural case, of primary interest in telephone problems. If 
speech is recelved binaurally, we assume that the ‘‘average” listener 
is symmetrical and compute the loudness separately for each ear, 
adding the resulting loudness numbers to obtain the total loudness. 
[See equation (5).] Thus, in cases where the speech signals delivered 
to the two ears are identical the resulting loudness is twice that 
obtained with one ear alone. 


3.4.1 Computation of Speech Loudness, N, 
The procedure followed in computing speech loudness is most read- 
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ily described in terms of the computation form shown in Fig. 14. The 
computation involves four steps: 


(1) Calculation of the received speech signal effective level, Z,, at 
a number of selected frequencies; 

(2) Determination of n, , the loudness in a band of unit importance, 
from Z, ; | . 

(3) Determination of the loudness in a computation band, the 
product of n, and AS; 


(4) Summation of the loudness numerics across the band of interest. 


Column 1 of Fig. 14 lists the frequencies at which the computations 
are made and columns 2 through 7 are used for determining Z,. The 
values entered in column 2, obtained from Fig. 9 (reference effective 
spectrum), apply for an acoustic talking level of 90 dBt. 

Columns 3, 4, and 6 are respectively: 7; , the orthotelephonic re- 
sponse of the transmitting element; R,; , the orthotelephonic response 
of the receiving element; and L, the response of the circuit inter- 
connecting the transmitting and receiving elements. (Refer to Fig. 
3 for appropriate orthotelephonic definitions.) In many cases of 
interest in telephony, the same transmitting and receiving elements 
are likely to be used in many computations. Column 5 of the form 
provides for combining responses of these elements with By) — X. 

The effective level of the received speech spectrum, Z,, resulting 
from combining entries of columns 5 and 6, is entered in column 7. 
The effective level in each computation band is converted to a loud- 
ness number for a band of unit importance using the curve of Fig. 6. 
The loudness numbers, entered in column 8, apply for frequency 
bands having 2-percent importance. The bands of Fig. 14 with mid- 
frequencies of 600 Hz and above have been selected to give 2-percent 
importance. Below this frequency, however, the 2-percent bands be- 
come very narrow (see Fig. 5), and circuit measurements at the re- 
quired frequencies provide a fineness in the response curves not re- 
quired for telephone rating purposes. Thus, wider bands are used 
and the greater importance of these bands allowed for by means of 
column 9 which shows the number of 2-percent bands in each compu- 
tation band. The loudness for a computation band is thus the product 
of column 8 and column 9 entries, and is entered in column 10. The 
total loudness, NV, , is obtained by summing column 10. 

In some cases, the computation frequency is not the midfrequency 
of the computation band according to Fig. 5. The midfrequencies 
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* APPLIES ONLY FOR A LONG-TERM SPEECH SOUND Ns= In,As = 
PRESSURE LEVEL OF 90 dBt AT 2” FROM THE 
TALKER’S LIPS. FOR OTHER TALKING LEVELS, 
CHANGE ENTRIES OF COLUMN 2 BY THE DIFFER- 
ENCE BETWEEN 90 dBt AND THE TALKING LEVEL 
OF INTEREST. 


Fig. 14—Speech loudness computation form. 


were rounded off as shown for purposes of convenience. For the 
lowest computation band, this rounding off overemphasizes that band’s 
contribution to the loudness, but this generally affects the total loud- 
ness number, N, , only slightly. 

The computation bands and’ the computation frequencies of Fig, 14 
were selected for convenience. These can be changed, e.g., to consider 
1/3-octave bands, by first locating the band limits on Fig. 5, then 
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determining (7) the center frequency for, and (7) the number of 
bands of unit importance contained in, the selected computation band. 
Appropriate values of By) — X can be obtained from Fig. 9. 

The procedure described above applies for cases where the loudness 
of a band of speech is determined by the effective level of the speech 
within that band. Masking experiments have shown that when the 
effective level of a continuous spectrum sound changes abruptly with 
frequency, e.g., a sharp cutoff filter applied to thermal noise, there 
may be masking in the suppressed frequency band due to energy in 
the passed frequency band. Since masking has been identified with 
loudness, it is reasonable to assume that the loudness of a band- 
limited speech signal will depend not only on the passed region but 
the suppressed region as well. This effect, termed spread-of-loudness, 
is considered in the computation by assuming that Z, does not decrease 
by more than 10 dB between adjacent 2-percent loudness bands.* If 
Z, (column 7 of Fig. 14) shows a more rapid change, a new Z, is en- 
tered which is exactly 10 dB below the Z, of the preceding or succeed- 
ing 2-percent band. 


3.4.2 Interpretation of Total Loudness, N, 

Loudness in loudness units (LU), although a ratio scale reflecting 
observers assessment of sound magnitude, has little physical signif- 
icance. It is thus desirable to express loudness in such a way as to con- 
vey a physical interpretation. This can be done by expressing the 
loudness of a sound in terms of the level of a reference sound which 
has the same number of loudness units. Thus, speech loudness can 
be expressed in terms of loudness level, i.e., the level of a 1000-Hz 
tone having the same loudness. Loudness level is an appropriate way 
to express speech loudness when comparing computed values with 
results of tests which involved loudness balancing speech and a 1000- 
Hz tone. Speech loudness (N,) can be computed using the form of 
Fig. 14, then with N, = Nyovo , the loudness level found from Fig. 7. 

Often speech through a test circuit is balanced against speech 
through some relatively distortionless reference circuit. In these cases, 
the loudness of the speech transmitted over the test circuit should 
probably be expressed in terms of the setting of the reference circuit. 


* The spread-of-loudness effect was determined from masking data obtained 
by Fletcher and Munson (Figs. 11 and 12 of Ref. 23 or Figs. 141 and 142 of 
Ref. 27) using narrow bands of noise. The masking data were plotted in terms 
of number of 2-percent bands below and above the nominal bandlimits of the 
noise. A straight line with a slope of 10 dB/band fitted the data with reason- 
able accuracy, thus permitting use of a simple rule for taking spread-of-loudness 
effects into account. 
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Thus, speech loudness can be expressed in terms of several different 
reference sounds. A single reference would be desirable to enable 
comparison of results from different tests. Loudness level would be 
suitable for such purposes. However, considering the different nature 
of speech and a 1000-Hz tone, a speech signal of specified character- 
istics might be a more suitable reference, one that would be more 
intuitively satisfying. 

The reference selected for the speech loudness computation pro- 
cedure is that due to transmitting average speech (Fig. 2) having the 
reference effective spectrum (Fig. 9) over a circuit that is flat on an 
orthotelephonic basis (Fig. 3) except for sharp cutoffs at 250 and 
4000 Hz. (These frequency limits, selected somewhat arbitrarily, were 
wide enough to include existing and planned commercial telephone 
circuits.) The loudness of speech from any telephone circuit can then 
be specified by the level of the reference that gives an equivalent N, ; 
the level, in turn, can be conveniently specified by its integrated rms 
pressure in dBt. The level of the reference speech spectrum, desig- 
nated as L,, is related to its loudness, N,, as shown on Fig. 8. This 
curve was derived using the form of Fig. 14, successively changing 
entries per column 2 thereon, and computing N, for each such change. 
It should be noted that (7) L, for the entries of column 2 is 90 dBt for 
the band 100-5000 Hz, 89.4 dBt for the band 250-4000 Hz and (it) 
spread-of-loudness was taken into account in the computation of N,. 


3.5 Comparison of Computed and Observed Results 

Observed results from six different subjective tests were compared 
to computed results obtained using the computation form of Fig. 14 
and the frequency response characteristics of the subjective test sys- 
tems. (Observed results from two of these tests were used in develop- 
ing the computational method.) Observed results comprised reference 
circuit attenuator settings or trunk settings obtained by observers 
when they loudness-balanced reference circuits and test circuits. 

Computed reference circuit settings were determined by first com- 
puting the loudness (N,) for each test condition and its corresponding 
reference condition (a reference circuit with its attenuator or trunk 
setting at the value to which observers adjusted to obtain loudness 
balance) using the test and reference circuit frequency response char- 
acteristics.* The computed loudness values (N,) were then converted 
to levels of the reference spectrum (LZ,). By comparing the L, of each 


* Frequency response characteristics used for the loudness computations dis- 
cussed In this paper are not reported. 
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test circuit with the L, of corresponding reference circuit at observed 
balance, computed settings were obtained. 

Computed and observed results are discussed and tabulated in Sec- 
tions 3.5.1 thru 3.5.6 and are plotted on Fig. 15. (For the latter, the 
L, for the reference circuit at balance is considered to be the observed 
value, the L, for the test circuit is the computed value.) Comparisons 
of computed and observed results are summarized in Table IV in 
terms of error distributions. These, together with Fig. 15 and tabu- 
lated values of ensuing sections, indicate that the computational 
method provides a high degree of accuracy. 


LEGEND 


STEINBERG LINEAR SYSTEM TESTS 
(1924) — SEE SECTION 3.5.1 


VAN WYNEN LINEAR SYSTEM TESTS 
(1940) — SEE SECTION 35.2 


VAN WYNEN TELEPHONE SET TESTS 
(1940) — SEE SECTION 3.5.3 


VAN WYNEN TIE LINE TESTS 
(1940) — SEE SECTION 3.5.4 


FRASER TESTS (1946) 
SEE SECTION 3.5.5 


CCIF TESTS (1939) 
SEE SECTION 3.5.6 


TEST Ls IN dBt (COMPUTED) 





20 30 40 50 60 70 80 90 100 110 
REFERENCE Lg IN dBt (OBSERVED) 


Fig. 15—Comparison of reference (observed) and test (computed) speech 
loudness. 
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TasBLeE [V—SuMMARY OF COMPARISON OF COMPUTED AND OBSERVED 


























RESULTS 
Computed Minus 
Observed 
Detailed 
Test Designation Results In Average-dB o-dB 
Steinberg tests (1924) 
Sensation level = 100 dB 3.5.1 1.4 1.1 
Sensation level = 39 dB 3.5.1 4.0 2.2 
Van Wynen linear 
system tests (1940) 3.5.2 —0.8 2.1 
Van Wynen telephone set. 
tests (1940) Bas) 0.3 1.1] 
Van Wynen tie line 
tests (1940) 3.5.4 —0.4 0.9 
Fraser tests (1940) 3.5.5 0.2 0.9 
CCIF tests (1939) 3.5.6 0.8 iat 











3.5.1 Steinberg Linear System Tests (1924) 

In 1924, J. C. Steinberg conducted loudness balance tests with a 
system which utilized linear transducers interconnected by either of 
two channels, a test channel into which various filters were inserted 
and a reference channel which provided essentially distortionless 
transmission.?*?9 Observers adjusted an attenuator in the reference 
channel to obtain loudness balance for each of a number of the test 
networks. 

Computed and observed loudness losses are given in Table V. For 
purposes of the table, loudness loss is equal to the amount of flat loss 
by which the reference circuit was varied from its reference setting 
(for the particular sensation level) so that speech through the 
reference channel had the same loudness as speech through the test 
channel. Observed values given for the 100-dB sensation level were 
used in deriving the S function. (See Section 3.3.4.1.) 

For the reference channel, L, with the 100-dB sensation level 
reference setting was 107.1 dBt; with the 39-dB sensation level 
reference setting, L, = 42.6 dBt. Observed and computed loudness 
losses per the table may be converted to DL, values for the reference 
channel by subtracting entries in the “observed” column from the 
above reference values; L, for the test channel may be obtained by 
subtracting entries in the “computed” column, for the appropriate 
test network, from the reference values given above. Thus, the ref- 
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erence channel L, = 102.1 dBt and the test channel LZ, = 100.1 dBt 
for a 500-Hz high-pass filter and a 100-dB sensation level. 

Agreement between computed and observed values is reasonably 
good for the most part. In general, the agreement becomes poorer with 
increasing distortion and is poorer at the lower sensation levels. 
Neither of these affects is particularly restrictive from a telephone 
engineering standpoint since they represent extremes which a well- 
engineered telephone plant will seldom approach. 


3.5.2 Van Wynen Linear System Tests (1940) 


In 1940, K. G. Van Wynen conducted loudness balance tests using 
a system equipped with linear transducers. (These tests were not 


TaBLE V—ComMPUTED AND OBSERVED LOUDNESS Loss VALUES FOR THE 
STEINBERG LINEAR Sysrem Tests (192+) 



















































Sensation Level = 100 dB Sensation Level = 39 dB 
Loudness Loss (dB) Loudness Loss (dB) 
= F = 
Test. A B E-—D 

Network* Observed |Computed (dB) 
.125 Hy HPF 0.6 2 4 0.2 
250 2.0 2.7 0.7 0.4 0.6 

375 3.6 5.0 1.4 —0.2 1.7 

500 5.0 7.0 2.0 1.0 3.8 

625 7.6 9.0 1.4 3.0 2.8 

750 8.5 10.8 2.3 0.6 6.8 
1000 10.5 13.8 3.5 3.2 6.6 
1250 13.2 16.2 3.0 4.5 6.8 
1500 16.6 18.5 1.9 5.3 7.7 

| ns ——— 
Avg. = 1.7 = 4.1 
=1]1.1 = 2.8 
ae i | Bal i‘ 

750 Hz LPF | 9.9 12.6 2.7 9.7 14.4 4.7 
1000 7.4 9.4 2.0 7.3 11.6 4.3 
1250 6.4 8.0 1.6 5.3 10.0 4.7 
1500 4.4 6.5 2.1 3.7 8.6 4.9 
2000 3.0 4.3 1.3 0.8 6.6 5.8 
2500 2.4 20 0.1 1.4 4.3 2.9 
3000 1.9 1.5 —0.4 0.2 3.0 2.8 
4000 0.7 0.4 —0.3 —0.5 0.4 0.9 

Avg. = 1.1 Avg. = 3.9 
c= 1.1 c= 1.5 
to _! 





* HPF = high-pass filter; LPF = low-pass filter. Entries designate nominal 
cutoff frequencies of the filters. See Ref. 48 for description of the filters which were 
probably used in the Steinberg tests. 
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TasLte VI—ComPutTeD AND OBSERVED RESULTS FOR THE VAN WyNEN 
LINEAR System Tests (1940) 

















Observed L, — dBt. Computed 
Reference Reference | Computed 
Circuit Reference Circuit: Minus 
Distorting Attenuator Test Circuit (at | Attenuator] Observed- 
Network* Setting-dB | Circuit | Balance) (Setting-dB dB 
2700 Hz LPF 24.2 81.4 80.8 23 .6 —0.6 
1750 Hz LPF 28.3 78 .3 76.6 26.6 —1.7 
1000 Hz LPF 31.3 74.1 73.4 30.6 —0.7 
250 Hz HPF 25.6 79.1 79.2 25.7 0.1 
550 Hz HPF 30.8 73.1 74.0 31.7 0.9 
1000 Hz HPF 36.4 67.6 68.1 36.9 0.5 
250-2700 Hz BPF 29.0 75.0 76.0 30.0 1.0 
550-1750 Hz BPF 38 .9 62.5 65.6 42.0 3.1 
1000 Hz Peak 
15 dB 26.8 80.2 78.1 24.7 —2.1 
30 24.2 81.9 80.8 23.1 —1.1 
56 34.5 70.3 70.1 34.3 —0.2 
700 Hz Peak 
30 dB. 25.8 81.4 79.0 23.4 —2.4 
2000 Hz Peak 
30 dB 28.7 82.8 76.2 22.4 —6.6 
Falling Loss 25.6 79.2 79.2 25.6 0 
Rising Loss 26.4 80.6 78.5 24.3 —2.1 
Avg. = —0.8 
c= 2.1 








* LPF = low-pass filter, HPF = high-pass filter, BPF = bandpass filter; numbers 
denote nominal filter cutoff frequencies. ‘“Peak’’ denotes a resonant circuit. defined by 
the resonant frequency (minimum loss of circuit occurred at resonance) and a dB 
value indicative of the Q, e.g., 56 dB indicates a high Q, 15 dB indicates a low Q. 
“Falling Loss’’ designates a network whose loss decreased monotonically with in- 
creasing frequency while for the “Rising Loss’? network, the loss increases mono- 
tonically with frequency. 


reported in the literature.) The transducers were interconnected via 
one of two channels, the test channel into which various test networks 
were introduced and a reference channel which was essentially dis- 
tortionless. Switching between channels was controlled by the ob- 
servers who, for each network tested, adjusted an attenuator in the 
reference channel to produce equal loudness. Twelve talker-listener 
pairs participated in the tests. 

Computed and observed attenuator settings are given in Table VI. 
The computed setting was obtained by noting the difference in Ls 
for the test circuit and the reference circuit at balance, and modifying 
the observed setting by this amount.” 

* The high- and low-pass filter results, expressed in different terms, were used 


in deriving the speech loudness computational method. See Section 3.3.4.1 and 
Fig. 10. 
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The agreement between observed and computed reference circuit 
settings is quite good, the error being less than 2 dB in ten of the 
fifteen cases. Those cases showing the larger errors represent severe 
distortion conditions which would be approached rarely, if ever, in 
a well-engineered telephone plant. 


3.5.3 Van Wynen Telephone Set Tests (1940) — 


In 1940, K. G. Van Wynen conducted tests in which different types 
of telephone sets were compared on a loudness basis. (These tests 
were not reported in the literature.) The laboratory system used con- 
sisted essentially of two separate acoustic-to-acoustic channels. One 
of these was equipped with reference transmitting and receiving 
telephone sets interconnected by a test network which included an 
adjustable attenuator. (The telephone sets were of the type described 
in Ref. 16.) The other channel used each of two different types of 
telephone sets, designated Test Telephone Sets A and B, with fixed 
connecting circuitry. (Test Telephone Sets A were of the type de- 
scribed in Ref. 15; Test Telephone Sets B were of a design similar to 
the telephone sets used with the Working Reference System described 
in Section 2.2 of this paper.) These sets differed from each other and 
from the reference telephone set in that they had transducers exhibit- 
ing different frequency response characteristics. 

Results of the tests are expressed in terms of trunk loss, variable 
and controlled by the observer, required in the reference channel to 
deliver speech equal in loudness to that from the test channel with 
a fixed trunk loss. (Twelve talker-listener pairs participated in the 
tests.) Computed and observed trunk losses shown in Table VII are 
in close agreement. The computed trunk loss was obtained by noting 
the difference in L, for the test channel and the reference channel, 
and modifying the observed trunk loss by this difference. 


3.5.4 Van Wynen Tie Line Tests (1940) 


In 1940, K. G. Van Wynen conducted loudness balance tests with 
laboratory systems which simulated selected circuit conditions char- 
acteristic of telephone connections between Bell Telephone Labora- 
tories and the American Telephone and Telegraph Company. These 
connections involved circuits which were called “tie lines,” and thus 
the designation “tie line tests.” 


_ TasBue VII—CompureD AND OBSERVED LOUDNESS LEVELS FOR THE VAN WYNEN TELEPHONE SET TESTS 











(1940) 
Computed 
Observed Trunk Loss at Minus 
Loudness Balance-dBt L, — dBt Computed Observed 
Test | | —) Reference Reference 
Telephone Test Reference Test Reference Trunk Trunk 
Set Network* Channel Channel Channel Channel Loss-dB Loss-dB 
A 4000 Hz LPF 5 16.8 76.3 76.8 17.3 0.5 
4000 Hz LPF 25 35.2 55.3 57.7 37 .6 2.4 
B 4000 Hz LPF 5 11.4 82.7 82.2 10.9 —0.5 
4000 Hz LPF 25 81.5 62 .2 61.7 31.0 —0.5 
Rising Loss 5 11.7 82.5 81.8 11.0 —0.7 
Rising Loss 25 31.2 61.5 62.2 31.9 0.7 
Avg. = 0.3 
o =1.1 








* 4000 Hz LPF = low-pass filter with a nominal cutoff frequency of 4000 Hz; Rising Loss = increasing loss with increasing fre- 
quency. The network was switched (by the observer) between reference and test channels and loudness balance achieved by adjusting 
an attenuator in the reference channel connecting circuit. 

t Values are the 1-kHz losses (900-ohm source and termination) of the reference and test channel connecting circuits. 
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Two test systems, designated “overall” and ‘“sidetone,” were used.* 
Each of these permitted comparison between test networks and either 
of two reference networks. (The transmitter and receiver used in these 
tests were of a design described in Ref. 17.) The observers’ task was 
to adjust an attenuator in the reference channel so that speech via 
this channel and the test channel were equally loud. Twelve talker- 
listener pairs participated in the tests. 

Results of the tests expressed in terms of reference circuit settings 
are shown in Table VIII. Corresponding values of L, (dBt) are given 
as parenthetical entries; those in the “Observed” columns apply for 
the reference channel while those in the “Computed” columns apply 
for the test channel. Computed settings were obtained by noting the 
difference between L, for the test channel and Z, for the reference 
channel, and modifying the observed circuit setting by this difference. 
Computed and observed values are in close agreement. 


3.5.5 Fraser Tests (1946) 

In 1946, J. M. Fraser conducted tests in which two different 
acoustic-to-acoustic channels were compared on a loudness basis. 
(These tests were not reported in the literature.) The test channel 
comprised test telephone sets of the type which were the Bell System 
Standard at that time,’®17 and connecting circuitry which simulated 
selected telephone connections. These simulations, arbitrarily desig- 
nated in column 1 of Table IX, exhibited different amounts of increas- 
ing loss with increasing frequency. The reference channel was the 
Master Reference System discussed in Section 2.1 of this paper. 

Results of the tests are expressed in terms of the Master Reference 
System trunk (600-ohm attenuator) setting, variable and controlled 
by the observer, required to achieve loudness balance between the 
test channel and the reference channel. Twelve talker-listener pairs 
participated in the tests. Computed and observed reference trunk 
settings shown in Table IX are in close agreement. 


3.5.6 CCIF Tests (1939) 


In 19389, the CCIF? conducted loudness balance tests on selected 
simulated telephone connections sent them by the American Telephone 


* These designations reflect the frequency response characteristics simulated 
and not testing conditions. Specifically, those with the “sidetone” system were 
on a listening only basis, and did not involve a talker listening. to himself via 
the system. Sidetone is discussed in Refs. 33, 49, and 50. 

+See Section 2.1 of this paper. 


TABLE VIII—Compvurep AND OBSERVED LOUDNEss LOSSES FOR THE VAN WYNEN TIE LINE TEsts (1940) 
































DL Reference Peaked Reference 
Circuit Setting-dB Computed Circuit Setting-dB Computed 
Test Minus Minus 
Network* Observed | Computed Observed-dB Observed Computed Observed-dB 
Overall 
20 DL — —_— — 6 .3(76.0) 7 .3(75 .0) 1.0 
20 DL —_ —_ —_ 6.8(75.5) 7 .3(75.0) 0.5 
20-3000 LPF 21.2(73.8) | 20.8(74.2) —0.4 6 .9(75.4) 8.1(74.2) 1.2 
20-2400 LPF 22 .5(72.5) | 22.7(72.3) 0.2 9 .6(72.7) 10.0(72 .3) 0.4 
20-1900 LPF 24.7(70.3) | 25.7(69.3) 1.0 13 .8(68 .5) 13 .0(69 .3) —0.8 
20-1900 LPF 26 .0(69.0) | 25.7(69.3) —0.3 12 .9(69 .4) 13 .0(69 .3) 0.1 
20-RL1 220(75.0) | 21.6(73.4) Sie 8.7(73 6) 8.9(73.4) 0.2 
20-RL2 23.3(71.7) | 22.9(72.1) —0.4 10.4(71 .9) 10 .2(72.1) —0.2 
20-RL3 24.4(70.6) | 23.3(71.7) af 11.3(71.0) 10 .6(71.7) —0.7 
z= —0.2 z= 0.1 
o= 0.6 ¢= 0.6 
Sidetone 
ef. DL 0.7(94.8) |—1.3(96.8) —2.0 —13.7(96.2) | —14.3(96.8) —0.6 
—10 DL 11.5(83.0) | 10.7(83.8) —0.8 —1.6(83.3) —2.1(83.8) —0.5 
—20 DL 19.2(75.3) | 19.0(75.5) —0.2 5 .4(77 .0) 6 .9(75.5) 1.5 
R900 5.4(89 .8) 6 .2(89 .0) 0.8 —6.9(89.1) —7 .0(89 .2) —0.1 
R1300 5.4(90 .0) 4.4(91.0) —1.0 —8 .6(90.9) —8.7(91.0) —0.1 
R1800 5 .2(89 .9) 3.1(92 .0) —2.1 —8.2(89.7) | —10.5(92.0) —2.3 
R8200-A 12.2(82.4) | 11.1(83.5) -1.1 —0.8(82.7) —1.6(83.5) —0.8 
R3200-B 12.2(82.5) | 10.1(84.6) —2.1 —2.1(83.9) —2.8(84.6) —0.7 
RL4 5.7(89 .6) 5 .0(90..3) —0.7 —8 .2(90.7) —7.8(90.3) +0.4 
z= 1 z= -—0.4 
c= 0.9 o= 1.0 
Overall: ¢ = —0.4 
c= 0.9 





* The number preceding the overall network designation defines the value of an attenuator associated with that network. For the 
DL network, this number (20) was the setting of an attenuator associated with the network when it was balanced against the peaked 
reference. The DL reference and the “20 DL”’ overall were the same circuit (incorporating in essence a variable attenuator) and differed 
from the DL circuits for the sidetone tests. The response of all DL networks showed decreasing loss with increasing frequency. LPF = 
low-pass filters with a nominal cutoff frequency (in Hz) noted. The RL networks had differing amounts of increasing loss with in- 
creasing frequency. Networks designated R were resonant circuits with high values of loss at the indicated resonant frequencies. 
Networks R3200-A and R3200-B differed in that the former had a slightly greater Q. 
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TaBLE [X—CoMPUTED AND OBSERVED LOUDNESS LEVELS FOR THE 
TEstT AND REFERENCE CHANNELS OF THE FRASER TEstTs (1946) 


Observed Computed 

Reference L, — dBt Reference Computed 
Trunk Se Trunk Minus 

Test Setting Test Reference Setting Observed 
Network (dB) System System (dB) (dB) 
A 33.3 67.5 69.2 35 1.7 
B 39.9 63.3 62.8 39.4 —0.5 
C 41.5 60.8 61.2 41.9 0.4 
D 43.0 58.8 58.9 43.1 0.1 
E 40.3 61.6 61.9 40.6 0.3 
F 43.8 59.7 58 .4 42.5 —1.3 
G 40.6 60.9 61.6 41.3 0.7 
Avg. = 0.2 
o =0.9 


and Telegraph Company. (These tests were not reported in the litera- 
ture.) These simulations were a subset of those discussed in Section 
3.5.4 of this paper. 

The CCIF tests involved two steps. First, the simulated connection 
designated “DL Reference” with 20 dB of added loss and the SFERT* 
were loudness balanced. Twenty talker-listener pairs, formed from a 
trained test crew of five persons, each provided seven balances. Results 
comprised settings (in dB) of the SFERT trunk (600-ohm attenuator) 
required to achieve balance; the average value was 30.9 dB.t 

The L, for the DL Reference (with 20 dB added loss) was com- 
puted using frequency response characteristics for the DL Reference 
of Section 3.5.4 while the L, for the SFERT (with an attenuator 
setting of 30.9 dB) was computed from response characteristics for 
the reference system of Section 3.5.5. The respective values were 74 
dBt and 71.6 dBt. That is, the CCIF loudness balance tests showed 
these circuits to be equal while the speech loudness computational 
method indicates that they differ by 2.4 dB. Probably this difference 
is due to the system response characteristics (primarily transducers) 
used in the computations, and not to the computational method itself 
being in error. In the computations, corrections reflecting differences 
in specific transducers of the same type were applied where known, 


* See Section 2.1 of this paper 

tAs noted in Section 2.1, the SFERT line attenuator setting (in dB) re- 
quired for loudness balance with a given system under test is, by definition, the 
Reference Equivalent (in dB) of that system. Thus, the Reference Equivalent 
of the 20 DL Reference connection is 30.9 dB. 
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but these were not known in all cases, and average responses were 
necessarily used in these cases. This argument is supported to some 
extent by the internal consistency for the tests of Section 3.5.4, the 
tests of. Section 3.5.5, and the CCIF tests. 

The DL Reference was then used as a secondary reference against 
which ten simulated telephone connections were loudness balanced. 
Computed and observed results given in Table X are in close agree- 
ment. 


3.5.7 Comparison of Computed Loudness Losses and Reference Equiv- 

. alents — 

Reference Equivalents are, by definition, loudness ratings of tele- 
phone connections obtained using the SFERT (and, by inference, the 
MRS and NOSFER). The speech loudness computational method 
provides loudness ratings in orthotelephonic terms. Since both are 
based on speech loudness, information from the preceding sections 
can be used to derive a correction factor which would permit trans- 
lating ratings between the two frames of reference. (Such translation 
depends on bandwidth characteristics of telephone connections and 
therefore will depend on, among other things, the particular trans- 
ducer types involved.) Tables XI and XII summarize the appropriate 
information. 

The average conversion factor from Table XI is 12.7 dB while that 
from Table XII is 15.0 dB, a difference of 2.3 dB. Some possible 
reasons for this difference were discussed in Section 3.5.6. In addition, 
the observer groups differed for the Fraser and CCIF tests, and there 
were probably some minor differences in test procedure. 


IV. LABORATORY MEASURING SYSTEM 


The speech loudness computational procedure could be realized as 
a laboratory measuring system in any one of several forms. The 
“deal” system would implement exactly the various functions of the 
computational procedure. The system sound source would energize 
- the transmit end of a telephone connection with an acoustic. signal 
having the amplitude characteristic of Fig. 2 and a time rate-of- 
change based on Fig. 5. The signal received at the other end of the 
connection would be applied to a measuring subsystem containing a 
‘network with a response according to Fig. 4. System operation so far 
corresponds to computing the effective spectrum, Z,. (See column 7 
~ of the computation form on Fig. 14.) 


TABLE X—LoUpDNESsS BALANCE RESULTS WITH DL REFERENCE CONNECTION AS REFERENCE 








OTL? 





Observed DL Computed 
Reference LL, — dBt DL Reference } Computed 
Test System Trunk Trunk Minus 
| ————_—__—_—— Setting Test DL Reference Setting Observed 
Type Designation* (dB) System System (dB) (dB) 
Overall 20-1900 LPF 23.5 68.3 70.5 25.7 2.2 
20-2400 LPF 21.7 71.3 72.3 22.7 1.0 
20-3000 LPF 20.8 73.2 73.2 20.8 0.0 
20-RL2 22.3 71.1 71.7 22.9 0.6 
20-RL3 22.2 70.7 71.8 23.3 1.1 
Avg. = 1.0 
¢ =0.7 
Sidetone Ref DL 13.5 82.8 81 11.7 —1.8 
—10 DL 23.7 69.8 70.8 24.7 1.0 
—20 DL 31.8 61.5 62.7 33 .0 1.2 
R900 19.0 75.2 75.5 19.3 0.3 
R3200-A 22.9 69.5 71.6" 25.0 2.1 
Avg. = 0.6 
¢ =1.3 
Overall: @ = 0.8 
o =1.1 





* See footnote of Table VIII. 
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TABLE XI—RELATION BETWEEN REFERENCE EQUIVALENTS (RE) AND 
LoupngEss Loss (LL) FRoM THE FRASER TESTS (SECTION 3.5.5) 


Test RE* L* LLt RE Minus 
System (dB) (dBt) (dB) LL (dB) 
A 33.3 67.5 21.9 11.4 
B 39.9 63.3 26.1 13.8 
C 41.5 60.8 28.6 12.7 
D 43.0 58.8 30.6 12.4 
E 40.3 61.6 27.8 12.5 
F 43.8 59.7 29.7 14.1 
G 40.6 60.9 28.5 12.1 
Avg. = 12.7 

c= 0.9 


* Columns 2 and 3 of Table IX. 
t 89.4 minus entries of column 3. See last paragraph of Section 3.4.2. 


The remainder of the measuring subsystem would consist of a device 
to reflect the spread-of-loudness effect, a device having the law of 
Fig. 6, an integrating circuit, and, finally, a display mechanism re- 
flecting the law of Fig. 8. 

The ‘‘ideal” system could be modified, without changing its essen- 


TaBLE XJI—RELATION BETWEEN REFERENCE EQUIVALENTS (RE) AND 
Loupnress Loss (LL) From CCIF Trsts (Srcrion 3.5.6) 


Test RE* Lt LLt RE Minus 

System (dB) (dBt) (dB) LL (dB) 
20-DL REF 30.9 74 15.4 15.5 
20-1900 LPF 34.4 68 .3 21.1 13.3 
20-2400 LPF 32.6 71.3 18.1 14.5 
20-3000 LPF 31.7 73.2 16.2 15.5 
20-RL2 33 .2 rae! 18.3 14.9 
20-RL3 33.1 70.7 18.7 14.4 
REF DL 24.4 82.8 6.6 17.8 
—10 DL 34.6 69.8 19.6 15.0 
—20 DL 42.7 61.5 27.9 14.8 
R900 - 29.9 75.2 14.2 15.7 
R3200-A 33.8 69.5 19.9 13.9 
Avg. = 15.0 
o = 1.1 





* Derived from column 3 entries of Table X and relation that DL Reference with 
20-dB trunk setting had a reference equivalent of 30.9 dB. 

t From column 4 of Table X and text of Section 3.5.6. 

t 89.4 minus entries of column 3. See last paragraph of Section 3.4.2. 
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tial attributes, by combining the “X” network response with that of 
the signal source. The source output would then have an amplitude 
characteristic as shown by the dash-dot curve of Fig. 9. (See column 
2 of the computation form on Fig. 14.) 

Realization of the system design outlined above is. limited by a 
number of practical considerations. Exact implementation would re- 
quire using an artificial mouth and an artificial ear which duplicated 
their human counterparts within the context of definitions given in 
Fig. 3. Such are not available. However, simpler devices are available 
which are probably adequate for purposes of many telephone engi-° 
neering problems. 

Secondly, the source signal does not incorporate the dynamic 
properties of speech. This is probably not important as long as the 
telephone connection under test is composed of linear elements. How- 
ever, measurements on connections incorporating ‘nonlinear elements, 
particularly carbon transmitters, may be in error. Carbon transmitters 
in real use are activated by real speech; testing such with an applied 
signal of the type referred to above might well result in different 
device operating characteristics. This matter is under study. 

In this paper, we consider a system which is somewhat simpler than 
the “ideal” and which probably reflects adequate accuracy for tele- 
phone rating purposes. This system, derived from the computational 
procedure, is designated the EARS (for Hlectro-Acoustic Rating 
System), and is dealt with in ensuing sections.* A graphical compu- 
tation procedure is also described. Finally, a comparison is made 
between ratings based on the EARS and ratings obtained from sub- 
jective tests. . 


The EARS utilizes an artificial voice and a 6-cm* coupler while the 
computational procedure utilizes the concept of orthotelephonic trans- 
mission. Response definitions appropriate to artificial mouth and 6-cm° 
coupler measurements are shown on Fig. 16 in terms similar to the 
orthotelephonic response definitions of Fig. 3. 

Typical artificial voice/6-cm* coupler amplitude responses of con- 
nections employing specific telephone set types are given in Ref. 18. 
As with orthotelephonic responses, artificial voice/6-em* coupler re- 
sponses vary widely depending on telephone set and transducer types 
used. Conversion curves for translating between orthotelephonic and 
artificial voice/6-cm* coupler responses are not as variable, but are still 
highly dependent on telephone set and transducer type. 


* Reference 51 describes derivation of a measuring system which is similar 


to the EARS. 
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TELEPHONE LINE TELEPHONE 
TRANSMITTER RECEIVER 


TRANSMITTER LINE RECEIVER 
RESPONSE = Tay RESPONSE = L RESPONSE = Reécm3 


Vi - 2 a 
z axl. = 20 LOG — = 20 LOGj— 
20 LOG Pr, Vi V2 


P,,= PRESSURE AT LIP RING OF ARTIFICIAL MOUTH, TRANSMITTER ABSENT GENERAL NOTE: 


SEE TEXT FOR 
Vi = OPEN CIRCUIT VOLTAGE, IN VOLTS, OF TRANSMITTER LOCATED AT DISCUSSION OF 


GAUGED POSITION RELATIVE TO THE ARTIFICIAL MOUTH LIP RING ARTIFICIAL MOUTH, 
P,.= RE DEVELOPED BY TELEPHONE RECEIVER IN A 6cm? COUPLER INCLUDING 
agr PRESS BVELO SPECIFICATION OF 
V2 = VOLTAGE ACROSS THE RECEIVER LIP RING PRESSURE. 


Fig. 16—Telephone circuit responses in artificial voice — 6-cm3 coupler terms. 


4.1 Derivation of KARS 

We begin by considering the computation form of Fig. 14. For the 
reference effective spectrum (column 2), we compute the cumulative 
contribution to loudness as a function of computation band, 1.e., we 
assume an orthotelephonically flat transmission system with 0 dB 
of gain for the sum of columns 38, 4, and 6, and compute the entries 
for column 10. Cumulating these and dividing each cumulative entry 
by the total loudness (N,) results in a relation between cumulative 
percentage contribution to loudness and frequency. This relation is 
shown by the small circles plotted on Fig. 17; the ordinate is cumula- 
tive and the abscissa is upper frequency limit of the computation 
band, obtained from Fig. 5. The relation of Fig. 17 differs from that 
of Fig. 5 because the latter is for a flat effective spectrum (Z; = a 
constant) whereas the former is for the effective spectrum of speech 
(Z; = Bo) — X) shown by the dot-dash curve of Fig. 9. 

The points plotted on Fig. 17 can be reasonably approximated by 
a straight line drawn from 100 Hz to 5000 Hz on the logarithmic 
frequency scale. Thus, 2-percent loudness bands, or any bands of 
equal loudness derived from this curve, have a logarithmic relation 
to frequency and may be found by laying off equal lengths along the 
logarithmic frequency scale. 

In terms of a measuring system, the above suggests using a source 
signal which has a flat amplitude versus frequency characteristic but 
which sweeps across the band at a logarithmic rate. Such a signal 
would cover equal loudness bands of the effective spectrum (Boy — X) 
in equal time divisions, corresponding to equal length divisions along 
the abscissa of Fig. 17. Integration of the sweep on a time basis 
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Fig. 17—Cumulative contribution to loudness of speech for the reference effec- . 
tive spectrum (see Figs. 9 and 14). 


would result in a value proportional to the total loudness of the effec- 
tive spectrum since the signal amplitude is constant with frequency. 

In essence, the operation described above translates the amplitude 
weighting of Z, = Bo) — X to frequency weighting. The computation 
form of Fig. 14 could be modified to reflect this by changing columns 
1 and 9 to conform with the straight line of Fig. 17 and entering a 
constant value of Z, in column 2. The value of Z, can be determined 
by dividing the computed NV, for By) — X by the number of computa- 
tion bands (e.g., 50) to find n, (the loudness per band of unit im- 
portance), then entering the curve of Fig. 6 at that value of n,. 

The source signal described above, in combination with the acoustic- 
to-acoustic response of a telephone connection, provides the effective 
spectrum of the received speech for that connection. The next step, 
then, is to apply the loudness scale of Fig. 6 in order to convert to 
loudness units. 

The loudness scale of Fig. 6 is linear on logarithmic coordinates, 
i.e., Z; = k log n,, above the knee of the curve, and the number of 
loudness units increases tenfold for a 44-dB change in effective level, 
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or doubles for a 18.2-dB change in effective level. (This is reasonably 
checked by subjective loudness tests with filters as plotted on Fig. 10. 
In the Steinberg tests, a 9.1-dB reduction in effective level corresponded 
to a 50-percent reduction in loudness, while in Van Wynen tests the 
level reduction required to produce half loudness was 12.4 dB). This 
relation is assumed to hold over the entire range of effective levels. 
The effect of this assumption on accuracy will be considered in a 
later section. 

At the receiving end of a telephone connection activated by the 
logarithmic signal source at the transmitting end, the received acoustic 
signal is applied to a measuring circuit and ultimately appears as a 
voltage across a resistance. At any instant, this voltage is proportional 
to 10°” where C is the response, in dB, of the telephone connection at a 
particular frequency. However, because of the sweep rate of the signal 
source, voltage elements appearing across the resistance in time sequence 
are proportional to 107*’”°, where Z, is the effective level of the received 
speech spectrum. We have already postulated that loudness doubles for 
a 13.2-dB change in Z, ; the voltage elements we would like to see 
across the resistance should therefore be proportional to 107*’**.* Thus, 
the desired voltage elements, V;, are proportional to the 2.2 root of 
the voltage elements, V,,, which actually appear, 1.e., 


V, «22 SV, (9) 
or 
Vi)" « V;,,. (10) 


The above relationship suggests that voltage elements proportional 
to 107*’*° be applied to the input of a 2.2-to-1 compressor. With. such 
a device, changes at the input proportional to 


10°" 1—Zs2/20) 


appear at the output proportional to 


10°7::-4s et 


* Tf loudness N2 is twice as large as loudness Nj, 


N» => 10648272 1/2) = 2 
N, i 
42 — 451 >= % logio 2 = 13.2, 
- and 
x = 44, 
Thus 


N « 1022/44, 
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which is the 2.2 root of the input change. Thus, the action of the com- 
pressor approximates the loudness scale of Fig. 6 in converting elements 
of the received effective spectrum to loudness units. 

Integration of the voltage with respect to time may be either on a 
square-law or linear basis. Without the compressor, square-law integra- 
tion adds voltage elements proportional to 107*’*° while linear integra- 
tion adds voltage elements proportional to 107’*°. By including the 
compressor with linear integration, elements proportional to 107*’™* are 
added, these in turn being proportional to loudness units. The com- 
bination of the two gives, in effect, 2.2 root law addition. 

System operation to this point corresponds to use of the computa- 
tion form of Fig. 14 in that the integrated voltage, V; , is proportional 
to N,. A means of expressing this voltage in dB is needed for the 
same reasons that the curve of Fig. 18 was derived to permit inter- 
pretation of N,. This can be accomplished using an indicating meter 
with a dB scale. By making the meter circuit an “averaging” one 
with a time constant long compared to the sweep time of the source 
signal, deflections of the needle will be approximately proportional to 
the total number of loudness units. A dB scale corresponding to the 
conversion specified by the curve of Fig. 13 could be provided. This 
might be difficult since the constant of proportionality of the needle 
deflections to total number of loudness units is not readily obtainable. 
Instead, advantage was taken of another approximation. 

Comparison of the curves of Figs. 6 and 13 shows that above their 
respective knees the shapes of these curves are the same. If we assume 
a straight-line relationship between L, and N, as was done earlier for 
the relation between n, and Z,, the addition of distortionless attenu- 
ation in any speech spectrum will produce a dB loudness level change 
equal to the change in attenuation. Thus, the meter scale can be so 
calibrated that it obeys the same law of addition as that provided 
by the compressor and linear integrator in converting elements of 
effective level to loudness, namely a 13.2-dB change in effective level 
would correspond to doubling the total number of loudness units. The 
scale on the meter would show a difference of 13.2 dB between half- 
scale and full-scale deflection of the needle, or between any two de- 
flection points, the greater of which is twice the smaller. Conse- 
quently, the meter reading would reflect a change in distortionless 
loss in the telephone connection under test, dB for dB. 
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4.2 Description of KARS 

A block diagram of EARS is shown on Fig. 18. The system con- 
sists of two parts: a signal source and a measuring subsystem. The 
latter includes provision for measuring both voltage and pressure.’ 


4.2.1 Signal Source 

The logarithmic oscillator provides at its output terminals a signal 
which sweeps logarithmically with time from 300 Hz to 3300 Hz to 
300 Hz and has a flat amplitude versus frequency characteristic. The 
reason for limiting the measuring band of the EARS to 300-3300 Hz 
is a practical one. The use of partial connection ratings as an engi- 
neering tool implicitly requires that, for any given connection, the 
sum of the partial ratings should approximately equal the overall 
rating. Thus, the bandwidth used to obtain these ratings should ap- 
proximate the bandwidth of the most restrictive element(s) in the 
connection in order to avoid cumulating bandwidth penalties when 
summing partial ratings. The specific limits of 300 Hz and 3300 Hz 
were selected by reviewing bandwidth characteristics of various 
telephone equipments and facilities. 

The oscillator sweep rate is 6 times per second where a sweep is 
defined in terms of the 300-3300-Hz band. Criteria leading to selec- 
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CONNECTION ADJUST MEASUREMENT 


COMPRESSOR 
(B) MEASURING SUBSYSTEM 


Fig. 18—Block diagram of the EARS. 
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tion of this sweep rate were (2) the sweep interval (1/sweep rate) 
should be small as compared to an easily realizable integrating time 
constant for the indicating meter and (2) the rate should approximate 
the syllabic rate of speech. The latter characteristic appears desir- 
able when measuring carbon transmitters in order to ensure that 
these operated at an efficiency comparable to that obtained under 
actual use conditions, i.e., with real speech applied. (Recent measure- 
ments indicate that the sweep rate can be changed over the range 
2 to 10 sweeps per second without significantly changing the ratings 
of many telephone sets of modern design.) . 

The conditioning network (a 6-dB attenuator) is momentarily 
switched out of the source circuit prior to measuring carbon trans- 
mitters thus increasing the source signal level by 6 dB. The higher 
level is intended to condition the transmitters to operate at the proper 
level. Conditioning is not required when measuring linear transmitters. 

The artificial mouth equalizer comprises a passive network whose 
frequency response is the inverse of the electric-to-acoustic response 
of the artificial mouth. The loss of this network is compensated for 
by the amplifier. 

The artificial mouth used is a permanent magnet, moving coil loud- 
speaking unit, and is, for all practical purposes, the equivalent of an 
earlier proposal.*? The mouth includes, as an integral part, a lip ring 
which is used as a reference for obtaining the proper spatial relation- 
ship between the artificial mouth and telephone instruments under 
test. The location of the lip ring has been empirically determined so 
as to correspond approximately to the plane of the lips of a human 
mouth.°*? 

Ideally, the source arrangement should, with an oscillator output 
which is flat with frequency over the band of interest, provide an 
output pressure at the artificial mouth lip ring which is also flat with 
frequency.” Practically, control of the overall response to within 
+1 dB of the 1000-Hz value provides acceptable operation, intro- 
ducing less than about 0.2 dB error in ratings. 


4.2.2 Measuring Subsystem 
The measuring subsystem is arranged to permit both voltage and 
pressure measurements. For voltage measurements, the input is a 


* The pressure 1s measured with a Type L microphone.®? The microphone is 
located in a carefully gauged position so selected that the pressure measured at 
that point corresponds closely to the pressure at the center of the lip ring 
opening.54 
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high-impedance transformer, and is bridged across the selected im- 
pedance (usually a 900-ohm resistor) terminating the telephone con- 
nection at the point where voltage measurement is desired. The trans- 
former is connected to an attenuator, used in system calibration to 
compensate for gain drift of amplifiers, thence to a switch node. 


The pressure measuring circuit consists of a 6-cm* test coupler 
equipped with a Type IL pressure microphone.* The microphone is con- 
nected to a condenser microphone amplifier which provides bias voltage 
to the microphone and which produces at its output a voltage propor- 
tional to the pressure developed in the 6-cm* cavity by the telephone 
receiver under test. The amplifier is connected through a calibrating 
attenuator to a switch node. 

The measurement mode is selected by operating the switch to 
“pressure measurement” or ‘voltage measurement.” The switch swinger 
connects to an attenuator (designated Rating Control), amplifier, 
compressor, and vacuum tube voltmeter. The compressor design em- 
ployed provides a 2.2-to-1 characteristic over a limited range. Opera- 
tion is confined to this range by holding the compressor output voltage 
at a constant value, indicated on the vacuum tube voltmeter as the 
“reference” or “zero” point, and adjusting for rating changes using 
the Rating Control. This takes advantage of the fact that, as pre- 
viously noted, flat attenuation changes ahead of the compressor equate, 
on a dB-for-dB basis, to loudness level changes. 


4.2.3 System Calibration 

System calibration consists of first removing the condenser micro- 
phone from the coupler and locating it at gauged position relative to 
the artificial mouth lip ring, then adjusting the source to deliver 
reference test pressure at that point. This adjustment is based on the 
condenser microphone calibration, and does not involve the EARS 
measuring subsystem. When the proper test pressure has been ob- 
tained, the EARS measuring system is switched to the pressure mea- 
suring mode, Rating Control is set at the “0” dB, and the “Pressure 
Adjust Attenuator” set to obtain reference reading on the vacuum 
tube voltmeter. Since the pressure spectrum being measured is flat 
with frequency, pressure level equates to loudness pressure level. 

Calibration of the voltage measuring mode proceeds in a similar 


* The simple 6-cm? test. coupler used, conforming in design to present standards 
(Fig. 3 of Ref. 55), has been found suitable for measurements of the type con- 
sidered in this report. However, handsets with ear caps of unusual shape may 
require a different coupler configuration. 
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manner. A voltage signal, derived from the log oscillator, is applied 
to a 900-ohm resistor. The voltage developed is first read directly with 
a voltmeter. Then the high-impedance bridging connection of the 
EARS measuring subsystem is connected across the resistor, the sys- 
tem switched to the voltage measuring mode, and the Rating Control 
set to read the voltage measured with the voltmeter above, i.e., if 
the latter was —2 dB relative to 1 volt, then the rating control is set 
to read —2 dB relative to the “reference” or ‘‘zero” setting. The 
“Voltage Zero Adjust” is then set to obtain “reference” or ‘zero” 
reading on the measuring subsystem vacuum tube voltmeter. Since 
the voltage spectrum being measured is flat with frequency, voltage 
level equates to loudness voltage level. 


4.2.4 Rating Measurements 


Ratings of partial or overall telephone connections are established 
by (t) the reading of the Rating Control and (7) the reference pres- 
sure and/or voltage levels employed. Examples of rating measure- 
ments, including signal levels employed with the present EARS, are 
given on Figs. 19, 20, and 21. The transmitting and receiving loops 
of Figs. 19 and 20 respectively are the same as those for the overall 
connection of Fig. 21. Sum of the component ratings indicates a 
loudness loss of 17.8 dB (= —18.8 + 25.6 + 10) while the overall 
rating from Fig. 16 is 17.1, a discrepancy of 0.2 dB. Had the mea- 
surement band been increased from 300-3300 Hz to, for example, 100- 
5000 Hz, the discrepancy would have been somewhat larger because of 
the bandpass response of both the transmitter and receiver. 
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Fig. 19—Transmitting loop rating. 
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NOTES: 


(1) SIGNAL SOURCE PROVIDES AN OUTPUT SIGNAL WHICH CONSISTS OF THE TEST SPECTRUM 
(FLAT WITH FREQUENCY) MODIFIED BY THE FREQUENCY RESPONSE OF COMPONENTS WHICH 
NORMALLY PRECEDE THE RECEIVING COMPONENT IN AN OVERALL CONNECTION. THIS IS 
NECESSARY IN ORDER TO PRESERVE ADDITIVITY OF COMPONENT RATINGS. 


(2) RATING CONTROL SETTING RELATIVE TO REFERENCE SETTING IN ORDER THAT THE INDICATING 
METER NEEDLE IS AT REFERENCE FOR THE SELECTED TEST VOLTAGE. THE LOUDNESS TEST 
VOLTAGE IS THUS —40 dB RELATIVE TO 1 VOLT. 


(3) SEE REF. 3 FOR RATING DEFINITIONS. THE TEST REFERENCE VOLTAGE IS SO SELECTED 
THAT IT IS NUMERICALLY EQUAL, IN dBre 1 VOLT, TO THE SELECTED TEST PRESSURE 
LEVEL IN dBt (SEE FIG. 19.) FOR ANY OTHER TEST VOLTAGE, THE EQUATION FOR RLR MUST 
CONTAIN A CONSTANT. 


Fig. 20—Receiving loop rating. 


4.3 Graphical Computation of Loudness 


A form for graphical determination of loudness ratings from the 
amplitude response characteristics of telephone connections is shown 
on Fig. 22. This form is based on the same considerations as those 
leading to the EARS, and is discussed in this paper because it is the 
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Fig. 21—Overall rating. 
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EARS LOUDNESS COMPUTATION 


TITLE—CIRCUIT DESCRIPTION AREA SQ. IN. 
BASE IN. 
HEIGHT——— IN. 
RATING—— dB 


INCHES 


a 
ae) 
z= 
n 
n 
oO 
pa 


0.2 0.4 0.60.8 1 
FREQUENCY IN kHz 





Fig. 22—Graph paper for computing loudness ratings. 


vehicle used to study the effects of differences between the loudness 
computation method and the EARS. 

The lower abscissa scale is frequency in Hz, corresponding on a 
logarithmic basis to the upper abscissa scale in inches. (Use of inches 
is arbitrary; any length units could be used.) Thus, equal increments 
on the upper scale correspond to equal increments of log fo/f,; where 
fe > fi. (This reflects the straight-line approximation shown on Fig. 
17). The equal distance increments closely approximate bands of the 
By) — X spectrum which arc interpreted to be of equal loudness when 
listened to by the human ear. 

The right-hand ordinate scale is proportional to the 2.2 root of a 
voltage, current, or pressure. This scale and grid are constructed 
according to the equation 


44 log 2X = 55 —L (11) 
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where 


X = inches (left-hand ordinate scale) 
L = loss of circuit in dB (right-hand ordinate scale). 


(Equation (11) assumes that the reference input to the connection 
is flat with frequency.) The ordinate is measured in inches from the 
bottom line of the graph which corresponds to zero output, voltage, 
current, or pressure. Correspondence between ordinate scales is dem- 
onstrated in Table XIII. Boxes at the top of the sheet are provided 
for recording graph measurements. 

In order to use the graph paper, the loss (in dB) of the partial 
or overall telephone connection must be known over the band of 
interest. This loss data may be in terms of 








input pressure input voltage input pressure (millibars) 
output pressure’ output voltage’ output voltage (volts) 
or 


input voltage (volts) 
output pressure (millibars) 


The loss frequency characteristic is plotted on the graph paper. 
The right-hand ordinate scale may be adjusted by a constant for 
negative losses, i.e., gains. Where large losses are encountered, greater 
accuracy can be obtained by similarly applying an adjustment con- 
stant. To illustrate, if the lowest loss across the band of interest is 
—15 dB, the values along the right-hand ordinate scale should have 
15 subtracted from them. If, on the other hand, the lowest loss is +15 
dB, the right-hand ordinate values should be increased by 15. 

The average height of the response as plotted on the graph paper 
is determined by measuring the area (in square inches) under the 
curve and dividing by the base width (in inches). The average height 
is then located on the left-hand scale and the corresponding dB value 
read from the right-hand scale. This dB value is the rating based on 
reference input level. 

For cases where the input spectrum is not flat with frequency, the 
graphical computation of a circuit rating involves determining two 
areas. The first of these is obtained from a plot of the actual input 
spectrum on the graph paper, the second from a plot of the input 


*See Ref. 3. 
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TABLE XIII— Retatrion BETWEEN ORDINATE DISTANCE AND Loss 














SCALES 
Left-Hand Ordinate Right-Hand Ordinate 
Scale-Distance (Inches) Input/Output Scale-Loss (dB) 
pe aes ; : ore are : eae aE 
0.5 561 55 
5 3.55 1 
8.89 1 0 


spectrum modified by the circuit response. The loudness rating is 
then the dB difference between the losses computed for these areas. 


V. COMPARISON OF GRAPHICALLY DETERMINED RATINGS AND OBSERVED 
RESULTS 


The speech loudness computation method accurately predicts loud- 
ness performance of telephone connections (see Section 3.5). Labora- 
tory realization of this method, the EARS, involved a number of 
simplifying assumptions. Thus, validation of the EARS approach re- 
quires considering (7) the accuracy of the EARS in predicting loud- 
ness performance and (7) the effects of the various simplifying as- 
sumptions. We do this by comparing computed ratings and observed 
results for the subjective tests of Section 3.5. (Numerous other tests 
have been performed which support the EARS concept, but these are 
not considered because either the tests were limited, i.e., incomplete 
test designs, small observer groups, etc., or the test system amplitude 
response characteristics are not known.) 


TABLE XIV—IFEATURES OF THE SPEECH LOUDNESS COMPUTATION 
MeEruHop AND THE EARS Metuop 

















Computation EARS 
Feature Method Method 
Loudness Law Figs. 6 and 8 Linear Portions of 
Figs. 6 and 8 
Spread-of-Loudness 
Correction Yes No 
Analysis Bandwidth 100-5000 Hz 300-3300 Hz 
Reference Bandwidth 250-4000 Hz 300-3300 Hz 
Rating Definition Orthotelephonic Artificial Voice/6-cm? 
Terms Coupler Terms 


(Fig. 3) (Fig. 16) 
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Validation of the EARS utilized the graphical computation method 
described in Section 4.3. This approach was necessary because the 
systems used in the tests were not available for direct measurement 
using the EARS. However, the frequency response characteristics of 
these systems were known and, thus, their loudness ratings could be 
computed. 

In theory, the graphical form (Fig. 22) and the computation form 
(Fig. 14) should provide about the same results. The reference of the 
former (the horizontal line at 0 dB of loss) corresponds to that of 
the latter (entries of column 2) because of the relation between fre- 
quency weightings of the two references. There is, however, one 
important difference between the two forms. For the graphical form, 
the loudness versus effective level relation has a constant slope 
(straight-line portion of Fig. 6) while for the computation form, this 
relation has a pronounced change in slope at low effective levels (see 
Fig. 6). The effects of this difference should be most apparent at low 
received speech levels where the graphical method would indicate a 
higher received speech loudness than would the computational method. 

With the above in mind, we can now consider the ratings for the 
various tests under a number of different conditions. These conditions, 
listed in Table XIV, reflect the differences in features of the speech 
loudness computation method and the EARS. 

Computed and observed ratings are given in Tables XV through 
XIX in terms of loudness loss (positive entries) or loudness gain 
(negative entries). The arrows and associated numbers below the 
tabular entries refer to distributions of differences between the various 
conditions considered. | 

We will consider Table XV in some detail, noting that since the 
other tables have an identical arrangement, such discussion will in 
general similarly apply to these tables. Referring to Table XV, 
column 1 designates the network tested, and is repeated from Table 
V. Column 2 designates the setting of the reference network for which 
the test observers judged the test and reference networks to provide 
equally loud speech. Columns 3 through 9 each contain 3 subcolumns; 


“a” and “b” are the loudness ratings of the test and reference chan- 


nels respectively and “‘c” represents the error, equal to “a” entries 
minus “b” entries. 

Column 3 entries are essentially a repeat of the information con- 
tained in Table V, obtained by converting the Table V entries to L, 
values, then subtracting these from 89.4 dBt, the level of the refer- 


ence speech spectrum of Fig. 2 transmitted over a system that is flat 


TaBLe XV—Compvutep LoupxNess Losses (DB) FoR STEINBERG LINEAR SystEM Trsrs (1924) 


(See also Table V) 
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1 2 3 4 5 6 7 8 9 
a b ¢ a b c a b a b ¢ a b a b a b € 
: GRAPHICAL COMPUTATION (Fig. 22) 
Orthotelephonic Responses Artificial Voice/6-cm? Coupler Responses 
Same as Computation Form Except 
Analysis Band = Analysis Band = Analysis Band = Analysis Band = 
Observed Computation Form Linear Loudness Law, Reference Band = Reference Band = Reference Band = Reference Band = 
Ref. Ckt. (Fig. 14) Linear Loudness Law | No Spread-of-Loudness 100-5000 Hz 300-3300 IIz 100-5000 Hz 300-3300 Hz 
Settin 
Test System (dB)” Test Ref. A Test Ref. A Test Ref. A Test Ref. A Test Ref. A Test Ref. A Test Ref. A 
125 Hz HPF —22.0 -17.5 -17.1 -0.4] -—17.38  -17.7) 0.4} —18.9 -19.3 0.4) -12.3 -12.8 0.5} -15.4 -14.8 -0.6] -—4.0 -4.1 0.1] -4.5 -3.9 -0.6 
250 Hz HPF —20.6 -15.0 -15.7 0.7 | —14.5 -16.3 1.8} -15.9 ~-17.9 -2.0 -9.3 -11.4 2.1} -153 —-13.4 -1.9 0.6 -2.7 3.3] —-4.4 ~-2.5 -1.9 
375 Hz HPF —19.0 —-12.7 -—14.1 1.4} -12.5 -14.7 2.2] -13.7 -16.3 2.6 -6.7 -9.8 3.1] -18.4 ~—13.0 —0.4 2.6 -1.1 3.7] -—2.7 -0.9 -1.8 
500 Hz HPF —-17.6 ‘-10.7 —12.7 2.0] —-10.5 -13.38 2.8] —11.2 -14.9 3.7 —4.6 -8.4 3.8] -10.5 —-10.6 0.1 4.8 0.3 4.5) -O.1 0.5 —-0.6 
625 Hz HPF —15.0 —-8.7 —10.1 1.4 —-8.8 -10.7 1.9 9.4 —12.3 2.9 —2.6 —5.8 3.2 3 —-8.0 —0.3 6.1 2.9 3.2 1.6 3.1 -1.5 
750 Hz HPF -14.1 -6.9 —-9.2 2.3} -6.8 -9.8 3.0 4 -11.4 4.0] -0.8 -4.9 4.1 5.8) 7.1 2.3 71 3.8 3.3 3.0 4.0 —1.0 
1000 Hz HPF —}2.1 —3.9 —-7.2 3.3 —4.7 —7.8 3.1 0 —9.4 5.4 2.6 2.9 5.5 > 5.1 3.6 10.4 5.8 4.6 7.4 6.0 1.4 
1250 Hz HPF -9.4 1.5 —4.5 3.0 -1.5 -—5.1 3.6 3 —6.7 6.4 6.2 —-0.2 6.4 3.5 —2.4 5.9 13.2 8.5 4.7 11.2 8.7 2.5 
1500 Hz HPF —6.0 0.8 —1.1 1.9 1.7 -1.7 3.4 3.2 —3.3 6.5 9.8 3.2 6.6 8 1.0 7.8 16.1 11.9 4.2 15.0 12.1 2.9 
750 Hz LPF 12.7 -—5.1  —7.8 2.7 -4.8 -8.4 3.6 3.6 0 6.4 3.0 -3.5 6.5 5.0 —-5.7 10.7] 11.2 5.2 6.0] 18.7 5.4 13.3 
1000 Hz LPF -15.2 -8.3 -10.3 2.0} -7.3 -10.9 3.6 wl oO 5.4) -0.3 -6.0 5.7] -0.6  -—8.2 7.6 8.7 2.7 6.0] 12.1 2.9 9.2 
1250 Hz LPF —16.2 -—9.7 -11.3 16] -9.5 -H.9 2.4 9.5 5 4.0] -2.8 -7.0 4.2} -4.5 —9.2 4.7 7.7 1.7 6.0] 11.0 1.9 9.1 
1500 Hz LPF —18.2 -11.2 -13.3 2.1} -11.1 -13.9 2.8 3 as) 4.2 4.7 —9.0 4.3 —7.2 —11.2 4.0 4.7 —-0.3 5.0 5.7 -O.1 5.8 
2000 Hz LPF —19.6 —13.4 -14.7 1.3) -13.1 -15.3 0 2.2 9 9 3.0] -7.3 -10.4 3.1] -10.8 12.6 1.8 2.6 -1.7 4.3 2.0 -1.5 3.5 
2500 Hz LPF —20.2 -15.2 -15.3 0.1 | -14.3 -15.9 1.6 5.70 17.5 1.8 —9.1 —11.0 1.9] -—13.3  —13.2 -0.1 0.2 —-2.3 2.5] -1.5 -2.1 0.6 
3000 Hz LPF —20.7 —16.2 —15.8 -0.4] -15.3 -16.4 1.1 | -16.4 —18.0 1.6] -9.9 -11.5 1.6] —14.3 -13.7 -0.6] —0.8 -2.8 2.0] —3.0 -2.6 -0.4 
4000 Hz LPF —21.9 -17.38 -17.0 —-0.3] -16.8 —-17.6 0.8] —18.1 —19.2 Ld] -11.5 -12.7) 1.2] -15.5 -14.9 -0.6] -3.8 -4.0 0.2] —-4.2 -3.8 -0.4 
Error: Average 1.4 2.4 3.6 3.7 2.5 3.8 2.4 
o 1.1 1.0 1.9 1.9 3.6 1.7 4.3 
Difference 0.2, 0.5 -0.5, 1.0 6.6 —3.3, 1.8 —1.0, 3.1 
Distributions Ea > a > —~C <—__———__> 
8.7, 1.0 
<< 
+11.0, 2.1 
‘ —$—$<$<—<$—< > 
125 Hz HPF 38.4 47.0 46.8 0.2 7 42.7 1.0 42.1 4i.1 1.0 48.7 47.6 1.1 45.4 0.2 57.0 56.30 0.7 56.5 56.5 0 
250 Hz HPF 38.8 47.8 47.2 0.6 ‘0 43.10 2.4] 45-1 0 46.5 0 3.6] 51-7 48.0. 3.7 45.8 —0.1] 61.6 56.7 4.9| 56.6° 56.9 —0.3 
375 Hz HPF 38.2 48.3 46.6 1.7 “0 42.5 6.0 47.3 40.9 6.4 14.3 47.4 6.9 45.2 2.4 63.6 56.1 7.5 58.3 56.3 2.0 
500 Hz HPF 39.4 51.6 47.8 3.8 a) 43.7 6.8 49.8 42.1 7.7 56.4 48.6 7.8 46.4 4.1 65.8 57.3 8.5 60.4 57.5 2.9 
625 Hz HPF 41.4 52.6 49.8 2.8 2 45.7 6.5 51.6 44.1 7.5 58.4 50.6 7.8 48.4 4.3] 67.1 59.3 7.8] 62.6 59.5 3.1 
750 Hz HPF 39.0 54.2 47.4 6.8 2 43.3 10.9 53.6 41.7 11.9 60.2 48.2 12.0 46.0 9.2 68.1 56.9 11.2 64.0 57.1 6.9 
1000 Hiz HPF 41.6 56.6 50.0 6.6 3 45.9 10.4 57.0 44.3 12.7 63.6 50.8 12.8 48.6 10.9 71.4 59.5 11.9 68.4 59.7 8.7 
1250 Hz HPF 429 58.1 3 6.8 (5 47.2 12:3] 60.7 45.6 15.1] 67.2 S21 151 49.9 14.6| 74.2 60.8 13.4] 72:2 61.0 11.2 
1500 Hz HPF 43.7 59.8 A 7.7 2.7 48.0 14.7 64.2 46.4 17.8 70.8 9 17.9 50.7 19.1 77.1 61.6 15.5 76.0 61.8 14.2 
750 Hz LPF 48.1 61.2 5 4.7 2 52.4 3.8 57.4 50.8 6.6 64.0 3 6.7 55.1 10.9 | 72.2 0 6.2] 79.7 66.2 13.5 
1000 Hz LPF 45.7 58.4 | 4.3 7 50.0 3.7 43.9 48.4 5.5 60.7 5.8 52.7 7.7 69.7 6 6.1 73.1 63.8 9.3 
1250 Hz LPF 43.7 56.8 A 4.7 oO 48.0 3.5 515 46.4 5.1 58.2 3 50.7 5.8 68.7 6 7.1 72.0 61.8 10.2 
1500 Hz LPF 42.1 55.4 a) 4.9 9.9 46.4 3.5 49.7 44.8 4.9 56.3 0 49.1 4.7 65.7 0 5.7 66.7 60.2 6.5 
2000 Hz LPF 39.2 53.4 6 5.8 9 43.5 4.4 47.1 41.9 5.2 a3.7 5.3 46.2 4.9 63.6 al 6.5 63.0 57.3 5.7 
2500 Hz LPF 39.8 SLE .2 2.9 7 44.1 2.6 45.3 42.5 2.8 5L.9 2.9 46.8 0.9 61.2 7 3.5 59.5 57.9 1.6 
3000 Hz LPF 38.6 49.8 0 2.8 3.7 42.9 2.8 44.6 41.3 3.3 51.1 3.3 45.6 1.1 60.2 ) 3.7 58.0 56.7 1.3 
4000 Hz LPF 37.9 47.2 3 0.9 <2 42.2 2.0 42.9 40.6 2.3 4955 2.4 44.9 0.6 57.2 8 1.4 56.8 56.0 0.8 
Error: Average 4.0 5.7 0 7.8 6.0 7.4 5.7 
o 2.2 3.9 4.6 4.6 5.3 3.9 4.6 
Difference —2.5, 2.5 —0.3, 1.0 6.6 —3.4, 2.2 —1.1, 3.3 
Distributions ——— OO a > xX —_— 


8.7, 1.0 
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TABLE X VI—ComputrEep LoupNEss LossEs (DB) For VAN Wynen Linzar Sysrem Tzsrs (See also Table VI) 





















































1 2 3 4 | 5 | 6 | ig 8 | 9 
a b ¢ a b c a b c a b é a b ¢ a b c a b c 
GRAPHICAL COMPUTATION (Fig. 22) 
Orthotelephonic Rexponses Artificial Voice/6-cm! Coupler Responses 
Same as Computation Form Except 7 ; 
Analysis Band = Analysis Band = Analysis Band = Analysis Band = 
Observed Computation Form Linear Loudness Linear Loudness Law, Reference Band = Reference Band = Reference Band = Reference Band = 
Ref. Ckt. (Fig. 14) Law No Spread-of-Loudness 100-5000 Hz : 300-3300 Hz 100-5000 Hz 300-3300 Hz 
Setting — 

Test System (dB) Test Ref. A Test Ref. A Test Ref. A Test Ref. A Test. Ref. A Test Ref. A Test Ref. A 
2700 Hz LPF 24.2 8.0 8.6 -0.6 7.9 8.9 —-1.0 6.9 7.6 —0.7 14.30 -O0.9 | 10.6 12.5  -1.9 | 21.1 21.6 -O.5 | 20.2 21.7 -1.5 
1750 Hz, LPF 28.3 11.1 12.8 -1.7 10.9 13.0 —2.1 9.9 W.7 1.8 5 18.4 -1.9 | 15.0 16.6 -1.6 | 24.6 25.7 -1.1 | 25.9 25.8 0.1 
1000 Hz LPF 31.3 15.3 16.0 —-0.7 16.3 16.0 0.3 15.9 14.7 1.2 {22.5 21.4 1.1 23.8 19.6 4.2 | 29.7 28.7 1.0 | 34.2) 28.8 5.4 

250 Hz HPF 25.6 10.3. 10.2 0.1 10.3 10.3 0 9.1 9.0 0.1) 15.7 15.7 0 9.3 13.9 -4.6 | 25.5 23.0 2.5 119.4 23.1 -3.7 
550 H2 WPF 30.8 16.3 15.4 0.9 14.9 15.5 —0.6 14.0 14.2. —0.2] 20.7 20.9 -O.2 | 14.1 19.1 -—5.0 | 29.2 28.2 1.0 | 22.8 28.3 —5.5 
1000 Hz HPF 36.4 21.8 21.3 0.5 21.2 21,1 0.1 22.0 19.8 3.2 5 26.45 2.0 | 23.3 24.7  -1.4 | 36.0 33.8 2.2 [31.2 33.9 -2.7 
250-2700 Hz BPF 29.0 14.4 13.4 1.0 14.5 13.7 0.8 13.6 12.4 1.2 401901 1.3 | 11.9 17.38 —5.4 | 30.0 26.4 3.6 | 21.4 26.5 9 -5.1 
550-1750 Hz BPF 38.9 26.9 23.8 Bae | 26.8 23.6 3.2 20.5 22.3 1.8 ¥. 2.0 4.1 | 23.8 27.2 -3.4 | 43.0 36.3 6.7 | 33.6 36.4 —-2.8 
1000 Hz Peak 
15 dB 26.8 9.2 11.3 —2.1 9.7 11.5 ~1.8 8.0 10.20 —2.2] 14.7 16.9 —-2.2 5. —4.1 ) 22.5 24.2 -1.7 | 20.3 24.3 -—4.0 
30 dB 24.2 i ae 8.6 -1.1 7.5 8. —1.4 5.6 7 —2.0/ 12.4 14.3 -1-4¢ ‘ -5.5 | 20.8 21.6 —-O.8 | 16.2 21.7 —5.5 © 
56 dB 34.5 19.1 19.3 -0.2 19.0 19 —0.2 17.2 17.9 0.7 | 24.0 24.6 —0.6 2 -5.8 30.5 31.9 1.4 | 26.3 382.0 -5.7 
700 Hz Peak 
30 dB 25.8 8.0 10.4 —2.4 S.4 10.5 ~2.1 6.8 9.2 —-2.4/13.4 15.9 —2.5 8.5 14.1 -5.6 | 21.7 23.2 —-1.5 | 17.7 23.3 -5.6 
2000 Hz Peak 
30 dB 28.7 6.6 13.2 —6.6 7.3 13.4 -6.1 5.8 12.1 ~6.3) 12.3 18.8 —6.5 7.2. 17.0 -9.8 | 19.8 26.1 -—6.3 | 15.4 26.2 —-10.8 
Falling Loss 25.6 10.2. 10.2 0 10.8 10.3 0.5 9.4 9.0 0.41 16.0 15.7 0.3 | 12.5 13.9 -1.4 | 23.8 23.0 0.8 | 21.5 23.1 -1.6 
Rising Loss 26.4 8.8 10.9 —-2.1 9.3 11.1 ~-1.8 7.5 9.8 —-2.3] 14.1 16.5 -2.4 | 14.8 14.7 O.1 | 2b.4 23.8 ~-2.4 | 25.1 23.9 2 
Error: Average -0.8 —0.8 -0.6 —0.6 —3.5 0.1 -3.3 
a 2.1 2.0 2.6 2.4 3. 2.9 3.6 
Difference 0,0.5 —1.1,0.7 6.6 —4.4,3.0 ~—3.0,3.9 
Distributions ———-_ > — —_— i —__> 
8.1,0.9 
| 9.4,0.8 
! << 
i 
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TABLE XVII—ComputeEep LoupNEss LossEs (DB) FoR VAN WYNEN TELEPHONE Ser Tests (1940) 
(See also Table VII) 
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1 2 3 4 | 5 | 6 | 7 8 | 9 
a b © a b c a b c a b e a b c a b c a b c 
GRAPHICAL COMPUTATION (Fig. 22) 
Orthotelephonic Responses Artificial Voice/6-cm? Coupler Responses 
Same as Computation Form Except 
Test System Computation Linear Linear Loudness Analysis Band = Analysis Band = Analysis Band = Analysis Band = 
Observed Form Loudness Law, No Spread-of- Reference Band = Reference Band = Reference Band = Reference Band = 
as Ref. Ckt. (Fig. 14) Law Loudness 100-5000 iz 300-3300 Hz 100-5000 Hz 300-3300 Hz 
ele. Setting 
Set Test Network (dB) Test Ref. A Test Ref. A Test Ref. iA Test Ref. A Test Ref. A Test Ref. A Test Ref. A 
A 5dB-4000 Hz LPF 16.8 13.1 12.6 0.5 | 12.6 12.5 0.1} 12.4 10.5 1.9 18.5 17.1 1.4 10.5 10.5 0 26.2 23.3 2.9 18.7 18.0 0.7 
25dB-4000 Hz LPF 35.2 34.1 31.7 2.4 | 32.6 30.9 1.7 | 32.4 28.9 3.5 | 38.5 35.5 3.0 30.5 28.9 1.6 46.2 41.7 4.5 38.7 36.4 2.3 
B 5dB-4000 Hz LPF 11.4 6.7 7.2 -0.5] 6.7 7.1 -0.4/] 7.7 5.1 2.6 14.7 11.7 3.0 7.2 5.1 21 23.4 17.9 5.5 16.3 12.6 3.7 
25dB-4000 Hz LPF 31.5 27.2 27.7 —0.5) 26.7 27.2 ~0.5 | 27.7 25.2 2.5 | 34.7 31.8 2.9 27.2) 25.2 2.0 43.4 38.0 5.4 36.3 32.7 3.6 
5dB-Rising Loss 11.7 6.9 7.6 —0.7 6.5 69 —0.4 7.5 4.9 2.6 14.5 11.5 3.0 8.1 6.4 1.7 23.5 17.4 6.1 17.3 15.2 2.1 
25dB-Rising Loss 31.2 27.0 27.2 0.7 | 26.5 26.4 0.1 | 27.5 24.4 3.1 34.5 31.0 3.5 28.1 25.9 22 43.5 36.9 6.6 37.3 34.7 2.6 
Error: Average 0.3 0.1 27 2.8 1.6 5.2 2.5 
o 1.1 0.8 5 0.7 0.7 1.2 1.0 
Test Telephone -—1.0,0.5 —0.2 6.1 —8.0 -7.5 
<> Ss SS <—_\_—_—_>> 
Set A : 7.7 
2 
Fa ——— hah eae 
2S 8.2 
3 : OO 
2] Test Telephone —0.6,0.5 +1.0 7.0 —7.0,0.5 —6.6,0.5 
= —<—<——ucoer sir ix ——_— 
a Set B 8.8 
S << 
8 9.1 
5 ne 
| Reference —0.5,0.3 -—2.0 6.6 —6.1,0.7 —4.3,1.5 
= <> <> <> <_ 
= Telephone 6.1 
<> 
Set 7.9,0.6 
<—_— 
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TaBLeE XVIJJ—Compurrep LoupNnEss LossEs (DB) For VAN WYNEN Tie LINE SystEM TESTS 


(See also Table VIIT) 







































































1 4 | 5 | 6 8 9 
¢ b e a b a b c a b c a b € & b c 
. GRAPHICAL COMPUTATION (Fig. 22) 
Orthotelephonic Responses Artificial Voice /6-em Coupler Responses 
Same as Computation Form Except 
Computation Linear Linear Loudness Law, Analysis Band. = Analysis Band = Analysis Band = Analysis Band = 
Loudness No Spread-of- Reference Band = Reference Band = Reference Band = Reference Band = 
Law Loudness 100-5000 Hz 100-5000 Hz 300-3300 Hz 
Test System A Ref. A Test Ref. A Test Ref. A Test A Test Ref. A Test Ref. A 
Overall-DL Reference 
20-3000 LPF 2 2 6 —-0.4 3 14.6 —-0.3 13.4 13.4 0 19.9 20.0 —0.1 11. 12.5 -—0.9 —0.6 18.9 19.4 —0.6 
20-2400 LPF 5) A AS) 0.2 7 15.9 —0.2 15.1 14.7 0.4 | 21.7 21.3 0.4 13. 13.8 —0.4 0.6 21.6 20.8 0.8 
20-1900 LPF A al 9.1 1.0 6 18.1 —0.5 18.1 16.9 1.2 | 24.6 23.5 1.1 16. 16.0 0.7 2.5 25.6 23.0 2.6 
0 4 —0.3 19.4 -1.8 18.2. —0.1 24.8 -0.2 17.3 —0.6 1.2 24.3 1.5 
20-RL1 0 0 4 —0.4 15.4 0.1 13.9 14.2. —0.8 } 20.4 20.8 —0.4 12.¢ 13.3 -0.4 -—0.5 20.2 20.3 —0.1 
20-RL2 3 3 7 -0.4 16.7 0.3] 15.1 15.5 -0.4] 21.6 22.1 -0.5 | 14.8 14.6 -0.3 0.1 22.5 21.6 0.9 
20-RL3 4 7 8 -1.1 17.8 -0.5!] 15.3) 16.6 -1.3 |] 21.9 23.2 -1.3 | 14. 15.7 —0.8 —O0.8 | 23.2 22.7 0.5 
Overall-Peaked Reference 
-DL oa 4 A 1.0 12.1 1.3] 12.2 10.7 1.5) 18.8 17.2 156°.) dha 2.3] 25.3 24.4 0.9 | 18.3) 16.5 1.8 
8 a) 0.5 12.6 0.8 11.2 1.0 17.7 wet 1.8 24.9 O.4 17.0 1. 
20-8000 LPF 9 we 0 12 12.7 1.6) 13.4 11.3 2.1) 19.9 17.8 2.1 11. 2.0 | 25.9 25.0 0.9 | 18.9 17.1 1.8 
20-2400 LPF 9.6 A 7 0.4 15.4 0.3) 15.1 14.0 1.1 /-21.7 20.5 1.2 | 13. 1.1] 28.4 27.7 0.7 | 21.6 19.8 1.8 
20-1900 LPF 8 oll 9 -0.8 19.6 —2.0] 18.1 18.2 —0.1 | 24.6 24.7 -0.1 16. 0.2 | 32.5 31.9 0.6 | 25.6 24.0 1.6 
a 0 0.1 8.7 -1.1 17.3 0.8 23.8 0.8 1.1 31.0 1.5 23.1 225 
20-RL1 “rh 0 an 0.2 14.5 1.0] 18.9 13.1 0.8 | 20.4 19.6 QS* | 124 1.5} 26.8 26.8 0 4 1.3 
20-RL2 4 3 7.56 -0.2 16,2 0.8 | 15. 14.8 0.3 | 21.6 21.3 0.3 14. 1:2.) 28.7 28.4 0.2 6 1.9 
20-RL3 3 ay 4. -0.7 1754 0.2) 15.3 15.7 —0.4] 21.9 22.2 -0.3 14.4 0.9 | 28 29.4 -0.5 a) 1.7 
Sidetone-DL Reference 
Ref. DL ay 7.4 A 2. -5.5 —-1.8|—-8.0 -7.1 -0.9|-2.2 -0.5 -1.7 |-9. 320 -4.2 ) 4.1 58 -1.7 [-2.9 -1.1  -1.8 
—10 DL 5 6 4A -0.8 5.7 0.6: Bat 4.3 -0.6] 10.2 10.8 -0.6 2 3.8 -1.3]16.6 V7.1 -0.5 7 We 0.5 
—20 DL 2 9 gic: =O 13.3 -1.1 10.8 11.8 -1.0/ 17.8 18.5 —-0.8 9 9 -1.0) 24.9 24.8  -0.1 V7.8 18.0 -0.2 
R 900 4 a 6 0. -1.1 1.1) -1.4 -2.4 1.0) 4.3 4.2 0.1 28 -1.6] 13.3 10.7 0.6 ful BR 1.3 
R 1800 A 6 6 —-1. -0.9 -0.1 | —2.4 -2.8 —-0.1 4.2 4.3 —-0.1 2 0.3) 10.5 10.6 —-0.1 4.0 3.9 0.1 
R 1800 5.2 6 5 2. 1.0 1.4 3.7 2.4 1.3 2.8 4.3 1.5 3 1.2 9.6 10.5 -0.9 2.8 3.9 -1.1 
R 3200-A 2,2 9 .O 1. 6.38 -1.5 3.3 4.9 -1.6] 10.0 11.5 -1.5 2 -2.5 ) 17.40 18.1 —0.7 9.6 11.2 —-1.6 
R 3200-B 2 8 2, 6.3 —2.1 2.8 4.9 —2.1 9.3 11.4 —2.2 2 -1.5] 16.4 17.8 -1.4 10.20 11.0 —0.8 
RL4 20 a3) a4 0. 0.8 0.8 3.1 -—2.5 -0.8 3.5 4.3 0.8 2 0.8 YS 9.8 0 2.8 3.0 —0.2 
Sidetone Peaked 
Reference 
Ref. DL 7 7.4 6.8 0.6 7.3 a2 0.1 8.0 9.6 1.6 |-2.2 —2.2 0 —9. im 1.8 4.1 4.5 -0.2 [-2.9 —3.8 0.9 
—10 DL 6 5.6 6.1 —0.5 5.1 7 40.4 3.7 3.5 0.4/10.2 9.8 0.4 2 6 0.9) 16.1 17.0 -—0.9 9.7 9.2 0.5 
—20 DL 3.4 0) 12.4 1.5 12.2 0 1.2 10.8 9.7 1.1) 17.3 16.3 1.0 9.¢ mt) 2.0 | 24.9 23.3 1.6 WS 15.4 2.4 
R 900 a) 2 0.3 —0.1 “il 1.) [ —1.4 —2.5 1.1 4.3 4.1 0.2 |-1. 2 2.5) 11.3 11.5 0 5.1 3.4 17 
R 1300 5.6 6 —-1.5 -0.1 .O 0 2.0 | -2.4 -—4.4 2.0 4.2 2,1 2.1 —2 0 3.1) 10.5 9.6 0.9 4.0 1.7 2.3 
R 1800 2 6 0.3 2.3 4 28 —0.1 | -38.7 -3.7 0 2.8 2.9 —0.1 |-4.? A 0.9 9.6 10.5 -—0.9 2.8 2.2 0.6 
R 3200-A al 5.9 6.7 -0.8 8 5.8 -0.5 3.3 3.8 —0.5 10.0 10.4 —0.4 1. 2 -0.5 [17.4 17.8 —0.4 9.6 9.9 -0.3 
R 3200-B 2.1 4.8 5.5 —0.7 2 4.1 0.1 2.8 2.7 0.1 9.3 9.3 0 2 1 1.8] 16.4 16.7 -0.5 10.2. 8.8 1.4 
RL4 §.2 9 -1.3 0.4 6 —2.7 1.1] -3.1 —-4.0 O09} 3.5 2.5 1.0 |-4. es) 1.5) 9.8 98 0 2.8 18 1.0 
Error: Average —0.4 —0.1 0.2 0 0.3 O.1 0.8 
o 0.9 1.1 1.0 1.0 1. 0.9 | 
Difference —0.8,0.6 —1.4,0.5 6.5 —7.7,0.6 —7.1,0.7 
Distributions <——__——_. <——_-> < — 
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TasLe XIX—Computrep Loupnsss LossEs (DB) ror FRASER Tests (1946) (See also Table IX) 
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1 2 3 4 | 5 | 6 | 7 8 | 9 
a b c a b c a b ce a b c a b _ ¢ a b ¢ a b c 
GRAPHICAL COMPUTATION (Fig. 22) 
Orthotelephonic Responses Artificial Voice/6-cm? Coupler Responses 
Same as Computation Form Except . 
Analysis Band = Analysis Band = Analysis Band = Analysis Band = 
Observed Computation Form Linear Loudness Linear Loudness Law, Reference Band = Reference Band = Reference Band = Reference Band = 
Ref. Ckt. (Fig. 14) Law No Spread-of-Loudness 100-5000 Hz 300-3300 Hz 100-5000 Hz 300-3300 Hz 
Setting 
Test System (dB) Test Ref. A Test Ref. A Test Ref. A Test Ref. A Test Ref. A Test Ref. A Test Ref. A 
A 33.3 21.9 20.2 1.7 21.6 20.1 1.5 20.3 18.4 1.9} 26.9 25.0 1.9 | 21.0 °22.3 ~1.3 | 33.1 33.9 0.8 | 28.6 33.7 -5.1 
B 30.9 26.1 26.6 —0.5 26.2 26.7 —0.5 25.0 25.0 0 31.5 31.6 -—0O.1 | 25.9 28.9 —3.0 | 37.1 40.5 ~—3.4 | 32.9 40.3 —7.4 
Cc 41.5 28.6 28.2 0.4 28.1 28.3 -—0.2 26.9 26.6 0.3 | 33.5 33.2 0.3 27.6 30.5 —2.9 39.6 42.1 —2.5 35.3 41.9 —6.6 
D 43.0 30.6 30.5 0.1 29.7 29.8 -0.1 28.4 28.1 0.3 | 35.0 34.7 0.3 | 29.1 32.0 -—2.9 | 41.8 43.6 -1.8 | 36.7 43.4 -6.7 
Eb 40.3 27.8 27.5 0.3 27.3 27 1, 0.2 26.1 25.4 0.7 | 32.7 32.0 0.7 | 26.8 29.3 -2.5 | 38.6 40.9 -2.3 | 34.3 40.7 -6.4 
F 43.8 29.7 31.0 -1.3 29.7 30.6 -0.9 28.5 28.9 —0.4} 34.9 35.5 -0.6 | 28.2 32.8 -4.6 | 40.8 44.4 -3.6 | 35.7 44.2 -8.5 
G 40.6 28.5 27.8 0.7 29.2 27.4 1.8 28.0 25.7 2.3 | 34.6 32.3 2.3 | 28.8 29.6 -0.8 | 40.8 41.2 -0.4 | 36.5 41.0 4.5 
Error: Average 0.2 0.3 0.7 0.7 —2.6 —2.1 —6.5 
o 0.9 0.9 0.9 1.0 1.2 1.1 1.2 
Test System —0.2,0.5 -1.2 6.6 —5.8,0.3 —4.5,0.4 
<—_——_—__—_-_ vst eee 
Difference 6.1,0.3 
Distributions <> 
7.5, 0.2 
Reference <<. 
System —0.2, 0.3 -1.7 6.6 -2.7 —0.2 
CC OO ee Pa eee SS” <————~ 
Difference 8.9 
Distributions > 
11.4 
<< 
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on an orthotelephonic basis, is sharply bandlimited to 250-4000 Hz, 
and has 0 dB of loss within this band. For example, test network 
125 Hz HPF (sensation level = 100 dB) of Table V has an L, = 
107.1 — 0.2 = 106.9 dBt. The loudness loss is then 89.4 — 106.9 = 
—17.5 dB. 

Entries in columns 4 through 9 were computed using the graphical 
form of Fig. 22. Column 4 entries are essentially a repeat of the 
column 3 entries, the difference being, as noted earlier, that the 
former were computed using a linear loudness law while the latter 
were computed using the loudness law of Figs. 6 and 8. Column 5 
entries were computed without applying spread-of-loudness correc- 
tions. 

The reference spectrum for computing the values given in columns 
3, 4, and 5 was bandlimited to 250-3000 Hz. For example, entries 
of column 5 were obtained by plotting each of the responses on the 
form of Fig. 22, then measuring the area (square inches) enclosed 
by this response curve, the base line (designated 0 inches), and the 
left-hand and right-hand boundaries (designated 100 Hz and 5000 
Hz respectively). This area was then divided by the base (inches) 
corresponding to the 250-4000-Hz bandwidth, and the equivalent 
height (inches) converted to loudness rating (dB) using the left-hand 
and right-hand ordinate scales. Entries of column 6 were obtained in 
the same way as those of column 5 except that the area was divided 
hy the base (inches) corresponding to a 100-5000-Hz bandwidth. 
Thus, the change between columns 5 and 6 was simply one of refer- 
ence spectrum bandwidth. 

Entries of column 7 were obtained in the same manner as those of 
column 6 except that the area measured was restricted to the 300- 
3300-Hz bandwidth characteristic of EARS, and the reference spectrum 
was limited to the 300-3300-Hz band. Entries of columns 8 and 9 
were obtained in the same manner as were entries of columns 6 and 7 
respectively except that artificial voice/6-cm* coupler responses were 
used for the former, orthotelephonic responses for the latter. 

The entries of columns 9a and 9b were computed in a manner which 
reflects the essential features of the EARS and, therefore, these en- 
tries closely approximate (within the bounds of computational and 
measurement error) what would be measured with the EARS on those 
test and reference systems utilizing linear transducers. However, 
column 9a and 9b entries of Tables XVII and XVIII and column 9a 
entries of Table XIX do not necessarily represent what would be 
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measured with the EARS. The reason for this is that the systems for 
these cases utilized carbon transmitters which are nonlinear devices. 
Transmitter responses used in the computations, based on measure- 
ments made with real speech, would probably differ from the re- - 
sponses pertaining during an EARS measurement because of the 
highly different nature of speech and the EARS acoustic test signal. 
This matter is now under study. 

The error distributions corresponding to each of the seven compu- 
tation methods exemplified by the column 3 through 9 entries are 
summarized in Table XX. These distributions reflect the accuracy 
of the computed ratings in predicting reference channel setting for 
equal speech loudness. The entries of column 1 are repeated from 
Table IV. These show that the computational method provides an 
accurate means of predicting subjective loudness balances for all of 
the tests except the Steinberg linear system tests at a sensation level 
= 39 dB. As noted earlier, received speech levels in a well-engineered 
communications system will seldom be this low, and if they do occur, 
are likely to represent trouble conditions. 

Comparing the entries of column 2 to those of column 1, we note 
that there is little change in the error distributions, indicating that 
the graphical method is a close approximation of the computational 
method. Exceptions to this are the results for the Steinberg tests, par- 
ticularly at the lower sensation level where the average error and 
error standard deviation are somewhat greater than when using the 
computation form. The reason for this is the change from the loud- 
ness law of Figs. 6 and 8 to a linear loudness law. 

Column 3 entries indicate a further reduction in accuracy due to 
neglecting spread-of-loudness. The greatest change occurs for the 
Steinberg tests which involved numerous filter conditions and the 
Van Wynen telephone set tests which involved transducers with pro- 
nounced resonances. Both of these represent instances where spread- 
of-loudness effects would be important. Somewhat less change occurs 
for the Van Wynen linear system tests which also involved filter 
conditions. 

The remaining columns show errors which might be expected using 
the EARS as presently defined (column 7), and the EARS modified to 
incorporate a wider measuring band (column 6). For the most part, 
extending the band improves the accuracy appreciably with the ex- 
ception of the Steinberg tests and the Van Wynen telephone set tests. 
As regards the former, examination of Table XV indicates that extend- 


TABLE X X—ERrRor DISTRIBUTIONS 


























(1) (2) (3) (4) (5) (6) (7) 
GRAPHICAL COMPUTATION (Fig. 22) 
Orthotelephonic Responses Artificial Voice/6-cm? Coupler Responses 
Same as Computation Form Except 
Computation Linear Linear Loudness Analysis Band = Analysis Band = Analysis Band = Analysis Band = 
Form Loudness Law, No Spread-of- | Reference Band = | Reference Band = | Reference Band = | Reference Band = 
Test Series Designation (Fig. 14) Law Loudness 100-5000 Hz 300-3300 Hz 100-5000 Hz 300-3300 Hz 
Avg. e Avg. o Avg. o Avg. o Avg. o Avg. o Avg. o 
Steinberg Linear System 
Tests-Sensation Level = 100 dB 1.4 1.1 2.4 1.0 3.6 1.9 3.7 1.9 2.5 3.6 3.8 127. 2.4 4.3 
Sensation Level = 39 dB 4.0 22 &.7 3.9 7.0 4.6 7.8 4.6 6.0 3.3 7.1 3.9 5.7 4.6 
Van Wynen Linear System 
Tests —0.8 251 —0.8 2.0 —0.6 2.6 —0.6 2.4 —3.5 3.1 0.1 2.9 —3.3 3.6 
Van Wynen Telephone Set 
Tests 0.3 1.1 0.1 0.8 27 0.5 2 0.7 1.6 0.7 5.2 1:2 2.5 1.0 
Van Wynen Tie Line 
Tests —0.4 0.9 -0.1 1.1 0.2 1.0 0 1.0 0.3 1.4 0.1 0.9 0.8 1,2 
Fraser Tests 0.2 0.9 0.3 0.9 0.7 0.9 0.7 1.0 —2.6 1.2 —2.1 1.1 —6.5 1.2 
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ing the band substantially improves the accuracy for the low-pass 
filter conditions but decreases the accuracy for the high-pass filter 
conditions. 

The entries of columns 4 and 5, in a sense counterparts of columns 
6 and 7 respectively, indicate what might happen if an EARS were 
built to utilize an artificial voice and an artificial ear which were 
accurate simulations of the human voice and ear within the context 
of the orthotelephonic definitions (see Fig. 3), and the system were 
calibrated in conformance with these definitions. Comparing columns 
4 and 6, and 5 and 7, we see that the accuracy improves for the 
Van. Wynen telephone set tests and the Fraser tests, remaining about 
the same for the other tests. 

Let us now direct our attention to the difference distributions of 
Tables XV through XIX. These are given at the bottom of the tables 
together with arrows showing the computational methods compared. 
To obtain the difference distributions, differences were obtained be- 
tween numerical entries, condition by condition. The first number 
associated with an arrow represents the average difference, a positive 
number signifying that ratings entered in columns at the right-hand 
tip of the arrow are numerically larger than those at the left-hand tip 
of the arrow. The second number, if given, represents the standard 
deviation of the difference distribution. In many cases, a standard 
deviation is not given because it was found to be of the order of 0.1 
dB, insignificant for present purposes. 

The difference distributions referred to above are summarized in 
Table XXI. Note that there is a set of entries for each different 
transmitter-receiver combination excepting the Steinberg tests for 
which entries are given for each of the two sensation levels. 

Column 1 of Table X XI indicates that the graphical method results 
and computation method results differ very little except, as was noted 
in discussion of Table XX, in the case of the Steinberg tests with the 
sensation level = 39 dB. Differences in graphical ratings with and 
without spread-of-loudness included are also seen to be relatively 
small from entries of column 2. Entries of column 3 show that the 
change in ratings due to changing the reference spectrum bandwidth 
from 250-4000 Hz to 100-5000 Hz is essentially a constant, although 
some slight variation from test to test is apparent. 

The remaining columns, 4 through 7, show that (7) measuring system 
bandwidth differences and (77) orthotelephonic versus artificial voice/ 
6-cm* coupler differences are highly dependent on transmitter-receiver 


TABLE X XI—DIFFERENCE DISTRIBUTIONS 





Test Series Designation 


Steinberg Linear System Tests 
Sensation Level = 100 dB 
Sensation Level = 39 dB 

Van Wynen Linear System Tests 

Van Wynen Telephone Set Tests 
Test Telephone Set A 
Test Telephone Set B 
Reference Telephone Set 

Van Wynen Tie Line Tests 

Fraser Tests 
Test System 
Reference System 








(1) (3) (4) (5) (6) (7) 
Artificial Voice 
Graphical Reference 300-3300 Hz 6-cm* Coupler Responses 
Loudness Spectrum Minus 100-5000 Hz Minus Orthotelephonic Responses 
Rating 100-5000 Hz ce $$$ 
Minus Minus Artificial 
Computation Reference Ortho-, Voice/6-cem3 
Form Loudness Spectrum telephonic Coupler 
Rating 250-4000 Hz Responses Responses 100-5000 Hz 300-3300 Hz 
Avg. o Avg. o Avg. o Avg. o Avg. o Avg. | oo 
0.2 0.5 0.5 6.6 —_— —3.3 1.8 —-1.0 3.1 8.7 1.0 11.0 2. 
—2.5 2.5 -0.3 6.6 —_ —3.4 2.2 —1.1 3.3 8.7 1.0 11.0 2. 
0 0.5 -1.1 6.6 _ —4.4 3.0 —3.0 3.9 8.1 0.9 9.4 0. 
-1.0 0.5 —0.2 6.4 — —8.0 _ —7.5 _— 7.7 — 8.2 — 
—-0.6 0.5 1.0 7.0 — 7.0 0.5 —6.6 0.5 8.8 — 9.1 _ 
—0.5 0.3 —2.0 6.6 = —6.1 0.7 —4.3 1.5 6.1 — 7.9 0.6 
—-0.8 0.6 —1.4 6.5 = 7.7 0.6 care 0.7 6.6 0.5 7.4 0.5 
—0.2 0.5 -1.2 6.6 eta —5.8 0.3 —4,5 0.4 6.1 0.3 7.5 0.2 
—0.2 0.3 -1.7 6.6 = —2.7 — —0.2 _— 8.9 — 11.4 _— 
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combinations. These differences appear to fall into two categories: 
(zt) the Steinberg and Van Wynen linear system tests and the reference 
system of the Fraser tests for all of which linear, but different type, 
transducers were used; (27) the Van Wynen telephone set and tie line 
tests and the test system of the Fraser tests for all of which nonlinear 
transducers were used. The transmitter and receiver types were identical 
(although the specific transducers were different) for the reference 
telephone set of the Van Wynen telephone set tests, the Van Wynen 
tie line tests, and the test system from the Fraser tests. 

That the difference distribution standard deviations are relatively 
small and there seems to be a strong dependence of average difference 
on transmitter-receiver combination, suggests the possibility of apply- 
ing correction factors to ratings computed by one method, e.g., the 
EARS method, to obtain ratings for some other method, e.g., the 
computational method. The average differences summarized in Table 
XXI represent these correction factors. 

Returning to Table XX, we note rather large errors in some cases 
with the EARS method (see column 7) while the errors for the com- 
putation method are rather small (see column 1). Obviously, applica- 
tion of correction factors to the Steinberg, Van Wynen linear system, 
and Van Wynen tie line tests will not improve the EARS method 
error distributions since in each of the tests, the particular trans- 
mitter-receiver combination was common to: the test and reference 
channels. . 

Different transducers were used in the test and reference channels 
of the Van Wynen telephone set tests and the Fraser tests. Let us 
examine the effect of application of correction factors to column 9 
entries of Tables XVII and XIX on error distributions. The correc- 
tion factors can be obtained from Table XXI. When the appropriate 
entries of column 9 of Tables XVII and XIX are so converted, the 
error distribution for the Van Wynen telephone set tests 1s changed 
from one with average = 2.5 dB, o = 1.0 dB to average = 0.3 dB, 
o = 1.5 dB comparing favorably to the column 3 values (Table 
XVII). Similar values for the Fraser tests are average = —6.5 dB, 
o = 1.2 dB before correction, average = 0 dB, o = 1.3 dB after 
correction, comparing favorably to the column 3 values from Table 
XIX for which average = 0.2 dB, co = 0.9 dB. 

Review of preceding discussion and the information contained in 
Tables XV through XXI indicates that the EARS provides a simple 
and reasonably accurate method of measuring telephone connection 
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loudness loss. For a telephone plant utilizing telephone sets of a single 
design, or of several similar designs, loudness ratings determined 
following the EARS procedure can provide an effective tool in telephone 
transmission engineering. In situations where the plant utilizes some- 
what different telephone set designs, it appears that reasonably good 
design can result from using the EARS method to determine perform- 
ance with the individual set designs. Comparisons between designs 
may then be made by application of correction factors. Determination 
of these correction factors can be a time consuming and difficult task 
as is evident from consideration of the orthotelephonic and artificial 
voice/6-cm* coupler response definitions of Figs. 3 and 16 respectively, 
but it is worth noting that such correction factors need be determined 
only once for each transmitter-receiver combination mounted in a 
particular handset configuration. 
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The Transmission Performance of 


Bell System Toll Connecting Trunks 


By J. E. KESSLER 
(Manuscript received April 19, 1971) 


A systemwide survey of the performance of Bell System toll connecting 
trunks was undertaken in 1966. Various results of this survey have been 
used in other studies since that time. This paper collects the main results 
in a single writing. The sampling plan and measurement procedures 
are discussed briefly. Measurement results are presented in distributional 
form and as estimates for the transmission parameters relating to loss, 
message circuit noise, impulse noise, relative envelope delay, P/AR, 
harmonic distortion, and physical characteristics. Some results are pre- 
sented separately for major facilities and facility categories. 


I. INTRODUCTION 


The Bell System telephone network consists of a hierarchy of trans- 
mission facilities available for connecting one subscriber to another. 
These include the loops from the subscribers to their local telephone 
office, toll connecting trunks from the local to the toll office, and inter- 
toll trunks between these toll offices. Clearly, the transmission per- 
formance of a toll connection is affected by each part of this built-up 
connection. Overall measures of performance have been reported in 
previous connection surveys’’” and there has been a recently completed 
1969-70 Connection Survey.*~° There is a concurrent need for detailed 
knowledge of the specific parts of toll connections. This information 
finds important applications in the setting of trunk objectives for 
the various transmission parameters, and in the effort to simulate, on 
a computer, a model of the Bell System transmission network. To this 
end, this paper reports various population estimates of the transmission 
performance of toll connecting trunks, based on a systemwide survey 
undertaken in 1966. Individual portions of this information have been 
used separately, and this paper serves to bring all the results together 
in a single document. To the author’s knowledge, these data represent 


2741 


2742 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1971 


the only published estimates of toll connecting trunk performance. 
Although the mixture of facilities has changed with time, estimates 
of parameters for individual types of facility are expected to be rea- 
sonably stable. 

A toll connecting trunk is defined as any trunk connecting an end 
office (class 5) with a toll office (class 4, 3, 2, or 1). For purposes of 
this paper each toll connecting trunk consists of the trunk facility 
and any office equipment associated with the trunk. The office equip- 
ment may be attenuation pads, hybrid transformers, repeating coils, 
etc. Figure 1 shows the various types of toll connecting trunks used 
in the Bell System. 

In general, a toll connecting trunk consists of switching equipment 
at both ends and transmission facilities in between. Consistent with 
their application to trunk transmission objectives studies, trunks are 
measured on a switch-through-switch basis (see Fig. 2). This enables 
one to use these data in constructing estimates for overall connections. 
In toll offices measurements are made from a toll testboard, master 
test frame, or an outgoing trunk (OGT) board. In the end offices the 
testing is done from a master test frame or OGT board. Thus, the trunk 
includes (besides the actual facility of two-wire or four-wire voice- 
frequency cable, or carrier channel) the trunk circuit, repeating coils, 
four-wire terminating sets, and switching equipment, etc. 

The toll connecting trunks in the survey were measured for the 
following transmission parameters: 


(t) 1000-Hz loss 
(22) Frequency response 
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Fig. 1—Types of toll connecting trunks. 
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Fig. 2—Composition and testing arrangement. 


(17t) Relative envelope delay 
(iv) Message circuit noise 
(v) Impulse noise 
(vt) P/AR meter readings 
(vit) Harmonic distortion 
(viz) Level tracking on compandored facilities. 


All measurements were made in both directions of transmission with 
the exception of impulse noise. It was measured only incoming to the 
class 5 office. The frequency response and relative envelope delay were 
measured at 17 frequencies from 200 to 3400 Hz. Bell Laboratories 
personnel carried out these measurements, with one person at each 
end of the toll connecting trunk. 

In addition to actual measurements, information about trunk length, 
type of switching, and type of facility used for the toll connecting 
trunk was obtained from office records, such as circuit layout cards. 

There are many different facilities in use as toll connecting trunks 
in the Bell System. This survey encountered several separably iden- 
tifiable types of facilities and, where there was a significant sample, 
results are given for these facility subclasses. 


II. SAMPLING PLAN 


The sampled population consisted of all the Bell System toll con- 
necting trunks in the continental United States and Canada. Although 
the exact number was not known, it was estimated that there were 
about 800,000 toll connecting trunks in service in 1966. 

The sample of toll connecting trunks of this survey consisted of 
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150 trunks from 15 end offices, each having less than 400,000 annual 
outgoing toll messages (AOTM), and 242 trunks from 25 end offices 
each having more than 400,000 AOTM. These 392 trunks were measured 
for most parameters in both directions, thus yielding 784 population 
elements in the sample. 

The sample design consisted of a two-stage plan with first-stage 
stratification and with the primary units selected with probabilities 
proportional to a measure of size. (See Refs. 6, 7, or 8 for a general 
discussion of sampling plans of this type.) The primary units were 
Bell System end office (class 5) buildings. These end offices were strat- 
ified into two disjoint subpopulations, according to the number of 
annual outgoing toll messages originating in the end office. This was 
done because a pilot survey in 1965 indicated that large end office 
buildings usually have shorter toll connecting trunks than small end 
office buildings. The large end offices also tend to use voice-frequency 
cable (VF) toll connecting trunks rather than carrier. By stratifying 
the population into two groups that tend to internal homogeneity 
one can more efficiently procure the necessary estimates with the 
desired precision. The confirmation of this rationale will be borne 
out in a later section discussing the physical characteristics of toll 
connecting trunks. . 

The frame for the first-stage sample was a list of the 9052 Bell System 
end office buildings in service on January 1, 1964. These end office 
buildings were divided into two subpopulations, according to whether 
they originated 400,000 AOTM or not. From stratum 1 of end office 
buildings with less than 400,000 AOTM, 15 end office buildings were 
selected. From stratum 2, twenty-five end office buildings were selected. 
Each end office building was assigned a probability of selection propor- 
tional to its AOTM. This measure of size had been used for the first- 
stage selection of offices in the concurrently conducted toll connection 
survey, and practicality deemed its use again for the toll connecting 
trunk survey. There is a strong correlation between the AOTM and 
the number of toll connecting trunks in an end office building. The 
sampling of end office buildings in each primary stratum was performed 
independently. 

For the second stage of sampling a listing of all toll connecting trunks 
in each primary unit (i.e., the selected end office building) was obtained. 
For controlling the complexity of the survey, a three-day visit was 
planned for each office. This practical constraint indicated the ability 
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to measure 8 to 12 trunks in each office. Therefore, where possible, 12 
toll connecting trunks were chosen by simple random sampling, inde- 
pendently in each primary unit. The toll connecting trunks thus selected 
become the second-stage units of the sample design. 

It is possible for an end office to have trunks to more than one toll 
office building, requiring the tester at the toll office end of the trunks 
to travel to more than one building. The goal of testing 8 to 12 trunks 
from each end office was met except in three offices. In one office, only 
seven toll connecting trunks existed, and in the other two offices practical 
difficulties arose to limit the measurements to seven and five trunks. 

The data analysis consisted of using estimation formulas appropriate 
for the sample design described previously. These formulas yield esti- 
mates of population parameters (i.e., mean, variance, standard devia- 
tion, and 90-percent confidence intervals) based on the sample data. 
The estimates are weighted to account for the structure of the sampling 
plan, with its stratification and selection proportional to a measure of 
size. Probability sampling of this type has the distinct advantage of 
providing estimates of the important characteristics of the population 
from which we sample, and at the same time providing a quantitative 
measure of the sampling error. 


IiI. PHYSICAL CHARACTERISTICS 


For each of the toll connecting trunks of the sample a record was 
obtained of some of its physical characteristics. This included the route 
length of the trunk, the type of switching at each end of the trunk, 
and the type of facility comprising the trunk. From these data, estimates 
were made of trunk length distributions, facility composition of toll 
connecting trunks, and percentages of different switching machines 
used. 


3.1 Trunk Length 


The trunk length distribution is shown in Fig. 3. A substantial amount 
(21 percent) of the trunks are intrabuilding trunks resulting from the 
end office and toll office being in the same building. In these cases 
the toll connecting trunk usually consists of a 2-dB pad and switching 
equipment in tandem with a short length of cable. 

Basically, toll connecting trunks span short distances. About half 
the trunks are less than 10 miles long, and only 10 percent of the trunks 
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Fig. 3—Trunk length distribution. 


are longer than 30 miles. However, there were two trunks in the sample 
that were 151 miles long. This resulted from indirect geographic routing 
which added about 100 miles to their actual length. 

As previously mentioned, the survey sample had a primary stratifica- 
tion based on the belief that large and small offices were significantly 
different with regard to the types and length of the toll connecting 
trunks associated with them. These differences are corroborated in 
Table I which shows the estimated length and type of facility in large 
and small offices. The metropolitan offices (i.e., large, over 400,000 
AOTMs) tend to have toll connecting trunks much shorter than those 
in small offices, and they utilize voice-frequency cable to a much 
greater degree than the small offices. 

A finer relationship between trunk length and type of facility is 
shown in Table II, in terms of three specific length intervals, 0-15, 
15-30, and over 30 miles. The estimated percentage of each facility 
type used within each interval is given. 


3.2 Facility Composition 


The sample of 392 trunks contained many different facilities. Those 
types appearing sufficiently often to be estimated reliably are shown 
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in Table III. The percentages of both categories of facilities and single 
types of facilities are estimated in Table ITI. 


3.3 Switching Machines 


The toll connecting trunk encounters some type of switching at 
both the end office (class 5) and the toll office (class 4 or higher). In 
this survey we noted the type of switching machine used at each end 
of each toll connecting trunk in the sample. The numbers of the type 
of switching machine used are classified by end office, toll office, and 
stratum, and are given in Table IV. There were 40 end office buildings 
and 66 toll office buildings in the survey, with some office buildings 
having more than one type of switching system. 

It should be noted that these numbers of switching machines are 
only the types of switching that were found with the toll connecting 
trunks of this survey. They should not be taken to accurately represent 
the distribution of switching machines in the Bell System, since the 
basic population element that was sampled in this survey was a toll 
connecting trunk and not a switching machine. Also, even though the 
end office buildings of the primary strata selections may be related to 
switching machines, the same can not be said of the toll offices. In 
many cases, all the trunks from an end office home on the same switch- 
ing machine in the toll office, thus biasing any estimates at the toll 
office. 

At the toll offices there is a sizable amount of switching designated 
as ‘switchboard.’ This category is composed of those trunks for which 
the transmission measurements were made at a switchboard. Therefore, 
this category includes toll switching, recording-completing, and operator 
office trunks (see Fig. 2). This category does not constitute manual 
switching, where all of a customer’s calls to the toll office have to be 
handled by an operator at a switchboard. In all the toll offices associated 
with this classification there also was a switching machine. This type 
of switching machine could have been used as an alternative designa- 
tion to the switchboard classification. (In fact, operator office trunks 
may have a multipled switching machine appearance, as well as the 
switchboard appearance.) However, since the information recorded 
was to reflect the type of switching actually used by the sampled toll 
connecting trunks of the survey, the switchboard designation was 
chosen. 

In the class 5 end offices encountered in this survey about 54 percent 
of the trunks utilized step-by-step switching systems, and 34 percent 
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used No. 5 crossbar. These two types account for the bulk of the switch- 
ing machines in use at the class 5 end of the samovled toll connecting 
trunks. At the toll office, crossbar tandem systems were used most 
frequently, followed by step-by-step systems and switchboards (with 
the conditions mentioned above defining switchboards). 


IV. TRANSMISSION LOSS 


4.1 Frequency Response 


The loss of toll connecting trunks was measured at 17 frequencies, 
from 200 to 3400 Hz. The measurements were made switch-through- 
switch as shown in Fig. 1. At each frequency a tone at 0 dBm was 
transmitted over the trunk and measured at the receiving end with a 
25A Gain and Delay Measuring Set. This procedure was repeated for 
each direction of transmission over the trunk. Since the design of the 
measurement equipment precluded measuring losses greater than 31 dB, 
it was not always possible to measure some trunks at the higher fre- 
quencies. This was the situation in particular on carrier facilities, 
such as N1, ON, and Lenkurt. For losses greater than 31 dB, the value 
of 31 dB was used. This provides a conservative estimate of the mean 
loss, but is more realistic than simply eliminating these readings. Also, 
this effect occurred only at 3300 and 3400 Hz, and only on some carrier 
facilities. 

The frequency response statistics for all the toll connecting trunks 
in the survey are shown in Table V. The results show the mean and its 
associated 90-percent confidence interval, and an estimate of the 
standard deviation for each distribution that is considered. Table V 
also shows the loss estimates for the two types of facilities, carrier 
and voice-frequency cable, used by toll connecting trunks. It appears 
that the loss responses for both types of facilities are similar throughout 
most of the voiceband, with the carrier facilities having a sharper 
cutoff at the high and low frequencies. 


4.2 Loss Relative to 1000 Hz 


For the individual types of facilities it is useful to present the loss 
relative to 1000 Hz. This enables one to see the loss characteristics 
of the facility itself more readily than examination of the absolute 
value of loss does, since different loss design procedures are encountered 
among the many trunks. Table VI shows the loss relative to 1000 Hz 
for the more commonly used carrier facilities. Table VII shows the 
loss relative to 1000 Hz for the major VF facilities. 


TOLL CONNECTING TRUNKS 2749 


Vv. NOISE 


The term noise is applied to a variety of electrical phenomena, all 
of which are unwanted additive signals and tend to interfere with the 
transmission of information. There are numerous sources of noise, some 
man-made and some not. Man-made noise is generated by power 
supply hum, faulty contacts, electrical apparatus, quantizing errors, 
etc. The noise from non-man-made sources can originate within or 
without the systems under measurement. All systems are prone to 
thermal noise from the random motion of electrons in conductors, 
as well as to atmospheric electrical disturbances. 

The effects of these forms of noise depend, in part, on the type of 
signal being transmitted over the telephone communication channel. 
For analog speech signals the average power of the noise is of primary 
concern. Interference from many noise sources contribute to this type 
of noise, referred to as ‘‘message circuit noise.”” However, data signals 
suffer more severely from noise voltage peaks (whether of short or 
long time duration), which are classified as “impulse noise.’”’ A different 
procedure is used to measure these two noise parameters and each is 
reported separately in the following sections. 


5.1 Message Circuit Noise 


The level of background noise was measured using the 3A Noise 
Measuring Set (hence, the name 3A noise) with both C-message and 
3-kHz flat weighting networks.” The noise measurements were performed 
for both directions of transmission on the trunk. The measurements 
were made during the normal business day. 

A summary of the noise measured with C-message weighting and 
3-kHz flat weighting is shown in Table VIII in terms of the mean and 
its 90-percent confidence interval, and standard deviation. Estimates 
for specific facilities and groups of facilities are included where there 
were sufficient data for useful statistics. 

It can be observed that in all categories the 3-kHz flat noise level 
is higher than with C-message weighting, since the 3-kHz flat weighting 
does not attenuate the low frequencies nearly as much as C-message 
weighting. The flat noise on voice-frequency facilities was slightly 
higher than on carrier, probably because carrier systems have a higher 
cutoff frequency at the low end of the frequency band, below which 
nothing is transmitted. 

The estimates of flat noise were significantly affected by the direc- 
tion of transmission being observed. A subclass analysis based on this 
characteristic showed that average flat noise measured incoming to 
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the end office (class 5) was considerably higher than that incoming 
to the toll office end of the trunk. Estimated statistics for the parameter, 
5, defined as the end office flat noise minus the toll office flat noise, 
are given in Table IX. The difference in noise levels is more pronounced 
on voice-frequency cable facilities than on carrier facilities because 
there is less low-frequency energy transmitted on carrier facilities, 
in either direction, due to the action of channel filters. 

Since the end office noise level is higher it suggests that its noise 
environment is worse than that of the toll offices. This could be due 
to extraneous low-frequency components (primarily 60 Hz and its 
harmonics) generated by office battery and other power plants. Reason- 
ing that larger offices have more noise-producing loads one would expect 
the larger end offices (strata 2) to have higher average noise levels 
than the small end offices (strata 1). This is borne out in Fig. 4, where 
a difference of about 6 dB is observed. There is further support for 
these hypotheses in comparing the toll office flat noise of toll connecting 
trunks with the toll office flat noise of short intertoll trunks’? (0-62.5 
miles). The result, with 90-percent confidence limits, is shown in 
Table X. 

The data for 3A noise with C-message weighting were also analyzed 
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Fig. 4—3A noise with 3-kHz flat weighting. 
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with respect to trunk length. For this purpose six mileage bands were 
chosen, with each succeeding band being a doubling of the previous 
interval. The “double distance” criterion is chosen since previous 
studies'’”’’° have shown a linear relationship between noise and double 
distance on toll connections. The toll connecting trunks of this survey 
are of relatively short length, but a similar linear relationship is evident, 
as shown in Table XI. Not only is there the effect of longer trunks 
accumulating more noise, but it should be recalled from Section III 
that the longer trunks are predominantly on carrier facilities. Both 
parameters, the length and facility type, contribute their effect to the 
noise level observed. The flat-weighted noise was independent of trunk 
length. This is still consistent with the past studies of intertoll trunks 
which showed only a very slight increase in flat noise with length for 
the shorter trunks. 


5.2 Background Noise on Compandored Trunks 


Since the syllabic compandor action on carrier facilities introduces 
a large loss to the noise in the absence of speech, a background noise 
reading with a quiet termination is not characteristic of a compandored 
channel in normal use (i.e., with speech present). In order to measure 
C-message weighted noise in a simulated speech condition, a holding. 
tone was sent over the trunk in order to adjust the compandor gains, 
and then it was filtered out at the receiving end. 

Measurements were made at four different levels of holding tone, 
in both directions of transmission on the trunk. The 1850-Hz tone 
was supplied by a Western Electric KS-19260 Pushbutton Oscillator, 
at levels of 0, —10, —20, and —30 dBm. After the tone was filtered 
out, the C-message weighted noise readings were made. 

Among the compandored toll connecting trunks there were those 
using Tl carrier, a digital system."’ This system uses instantaneous 
compandors which are active whether speech is present or not. The 
1850-Hz holding tone introduced distortion products into the measured 
signal as the fundamental and harmonies of the 1850-Hz tone interacted 
with the sampling frequency of the T1 system. Contrary to analog 
systems, the noise on T1 is mostly quantizing noise introduced by the 
encoder in the terminal, there being relatively small amounts of line 
noise. For these reasons, the T1 data are treated separately from the 
analog data. 

Table XII shows estimates of the mean, its 90-percent confidence 
interval, and the standard deviation of the C-message weighted noise 
as a function of input holding tone power level. A majority of the analog 
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compandored trunks were on N1 carrier, and their characteristics are 
also listed separately. For the analog carrier systems the noise com- 
ponents in the terminal equipment are low compared to the line noise. 
Hence, the measurements should reflect the expandor action at the 
near end where the measurements are made. Since a compression 
and expansion ratio of 2 : 1 is being used we would expect a 5-dB change 
in noise level for each 10-dB change in input signal level. For the analog 
facilities there is a ratio of about 10 : 7 between the 0- and —10-dBm 
level inputs, and a ratio of slightly less than 10 : 5 for the level inputs 
between —10 and —30 dBm. This compares to the theoretical 10 : 5 
ratio for perfect companding action. 


5.3 Impulse Noise 


Impulse noise was measured on a_ switch-through-switch basis, 
incoming to the class 5 offices. The measurements were made with the 
6F Voiceband Noise Measuring Set using C-message weighting. The 
four counters of the 6F Set were adjusted to cover the range of 4 to 45 
counts in the 15-minute recording interval. Then, on each trunk meas- 
ured, the recorded impulse counts and associated threshold levels are 
used to interpolate the levels that would correspond to 45, 15, and 4.5 
counts per 15-minute interval. The difference in the two levels that 
yield 4.5 and 45 counts is defined as the slope (in dB per decade) of 
the peak amplitude distribution. (See Ref. 12 for a discussion of these 
parameters.) On compandored facilities an 1850-Hz tone at a level of 
—14 dBm was applied to the trunk in the toll office in order to simulate 
data transmission trunk conditions. At the end office the tone was 
suppressed by a 90-dB band-reject filter with a 50-Hz bandwidth. 

The results presented in Table XIII are estimates of the mean and 
standard deviation of the 45- and 15-count level distributions, with a 
90-percent confidence interval on the mean estimate. The same esti- 
mates are shown for the slope. These data all apply at the switch at 
the receive end of the trunk. The compandored carrier trunks have a 
higher mean impulse noise level, but lower average slope, than the 
voice-frequency cable facilities. A lower slope indicates a narrower 
distribution of peak amplitudes, so that a smaller change in the reference 
count level is required to produce a change by a factor of ten in the 
impulse count. Figure 5 illustrates these points. With the exception 
of the estimates for all toll connecting trunks, the data of Table XIII 
and Fig. 5 result from trunks of less than 60 miles length. There were 
13 trunks longer than 60 miles, all of them on compandored carrier 
facilities. 
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Fig. 5—Average impulse noise levels at receive switch. 


VI. RELATIVE ENVELOPE DELAY 


In the transmission of complex waveforms a nonlinear phase shift 
characteristic of the transmission medium results in the distortion of 
the signal due to the differing amounts of delay of the different fre- 
quencies present in the waveform. This delay distortion can be assessed 
by measuring the relative envelope delay, which is the delay of the 
envelope of a low-frequency amplitude-modulated voice-freqeuncy 
carrier relative to the envelope delay at a reference voice-frequency 
carrier. The relative envelope delay is an approximation to the derivative 
of the phase characteristic, so linear phase would yield constant en- 
velope delay. In this survey all the envelope delay measurements are 
relative to 1700 Hz. 

The measurements of relative envelope delay were made using the 
25A Gain and Delay Measuring Set and KS-19260 Oscillator. A pair 
of these was required at each end of the toll connecting trunk, so as 
to measure both directions of transmission. 

The relative envelope delay data were analyzed both for toll con- 
necting trunks in general, and for various subclasses of interest. The 
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results show the estimated mean and its associated 90-percent confidence 
interval, and an estimate of the standard deviation of each distribu- 
tion considered. Figure 6 presents the relative envelope delay char- 
acteristics based on the sample of all toll connecting trunks. This 
reflects the differing influences of different types of facilities. For 
example, the nonloaded voice-frequency cable facilities have essentially 
no relative delay at the higher frequencies, so the relative delay derives 
from the carrier and loaded voice-frequency cable facilities. Also, in 
all cases the office equipment associated with the transmission facility 
is a major contributor to the delay at low frequencies.° 

The difference in the relative envelope delay between carrier and 
voice-frequency cable facilities is shown in Fig. 7. In general, the 
carrier facilities exhibit more relative envelope delay at the low and 
high frequency band-edges. The estimates for the carrier facilities are 
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Fig. 6—Mean relative envelope delay (includes all types of facilities). 
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Fig. 7—Mean relative envelope delay (carrier and voice-frequency cable facilities). 


only given up to 3200 Hz since excessive loss on the connection above 
this frequency precluded making the envelope delay measurements on 
the N1, ON, and O earrier facilities. 

Tables XIV and XV show the estimates of relative envelope delay 
for specific facilities. These subclasses were considered since a sufficient 
number of trunks of each type was measured, and the trunks were 
from a number of different offices. The seven types of facilities tabulated 
represent the facilities more commonly used for toll connecting trunks 
and they comprised about 85 percent of the total sample for this survey. 
Among the carrier facilities the delay on T1 showed the least variability. 
The delay on N1 has been reported in the past” where N1 was being 
used as an intertoll trunk, and those findings are comparable to the 
values reported here, with N1 now used as a toll connecting trunk. 
Although all the voice-frequency cable facilities exhibit similar delay 
characteristics at low frequencies, there is a marked difference between 
loaded and nonloaded facilities at the higher frequencies. This accounts 
for the higher variances of the VF facilities in the higher frequencies, 
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with the loaded VF facilities contributing most of this variance. The 
estimates of the variance for VF trunks with E6 or £23 repeaters are 
larger than other categories due to the correlation between delay and 
trunk length—the longer trunks exhibiting more delay. For example, 
the E23 measurements came from eight different offices, of which four 
offices supplied shorter trunks (5.3, 5.7, 6.2, 6.7 miles) and four offices 
had longer trunks (10.2, 11.7, 12.8, 16.3 miles). 

In considering toll connecting trunks in general, the correlation 
between trunk length and relative envelope delay is neither significant 
nor consistent. This is due to having many different facilities in the 
total sample and also to measuring the combined effects of office equip- 
ment and facility for each toll connecting trunk. Thus, there is only a 
slightly discernible relation between delay and trunk length at low 
frequencies (below 1700 Hz), since office equipment causes the pre- 
dominant effect. Above 1700 Hz the “VF Loaded’”’ facilities exhibited 
a significant length correlation. These facilities were for trunks of 
roughly 1 to 18 miles in length. A subclass analysis based on three 
mileage bands is shown in Fig. 8. The 90-percent confidence interval for 
each estimated mean value of delay (above 1700 Hz) is also shown. 


vil. P/AR 


The P/AR (Peak-to-Average Ratio) meter’® measures the ratio of 
the peak and full-wave rectified average values of a low duty-cycle 
pulse train transmitted over a transmission path. Its purpose is to 
provide a measure of the overall transmission distortion that is present 
from many different causes. The P/AR meter is most sensitive to 
envelope delay distortion and is also affected by noise, bandwidth 
reduction, gain ripples, nonlinearities such as compression and clipping, 
and other impairments. The P/AR reading is a indication of the gen- 
eral transmission quality of the voiceband channel. In particular, it 
is a measure of the phase linearity of the toll connecting trunk. If 
the P/AR signal were received entirely undistorted the P/AR meter 
would read 100. Distortion normally causes readings lower than 100. 

Four P/AR meter readings were made on each toll connecting trunk 
in the survey. Both directions of transmission were measured, using 
both polarities of the nonsymmetrical P/AR test signal. However, 
since there was no significant difference between readings using the 
normal and inverted polarity test signal, the average of the two was 
used in deriving estimates. P/AR generators now in production have 
eliminated this effect by inverting alternate signals. Table XVI shows 
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TRUNK LENGTH VS MILEAGE 


MILEAGE INTERVALS 
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Fig. 8—Relative envelope delay (VF loaded facilities only). 


the estimated values for the P/AR distributions of all toll connecting 
trunks and a variety of other subclasses based on types of facility. 

It should be noted that different facilities have different distributions 
of P/AR readings, so that the same numerical value may indicate a 
lower than expected reading on one facility but be a higher than ex- 
pected reading for a different facility. Hence, a comparison of subjective 
quality between trunks should be made only within a particular facility 
category, and not between categories. In general, comparing P/AR 
readings indicates the relative amount of intersymbol interference to 
be expected in each case. 

Since the P/AR meter reading is affected by envelope delay and 
frequency response it is not surprising that there is a relationship 
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between these parameters and the P/AR readings. In fact, there is a 
high degree of correlation between these characteristics, which implies 
that a trunk’s relative performance in any of the three measures is a 
good indication of its general transmission quality with relation to 
other trunks of the same type. 

To show the relationship between P/AR, envelope delay distortion, 
and loss, consider the performance of the more commonly used facilities 
with respect to the three mentioned parameters. Besides the mean 
P/AR reading for a category, the mean loss relative to 1000 Hz (dB) 
and the relative envelope delay (us) at 3000 Hz will be used. These 
values for the major identifiable categories are shown in Table XVII. 

Since P/AR, loss, and envelope delay have different underlying 
scales of measurement and unknown distributions, one method of 
measuring their relationship is by a rank correlation test.'* This measures 
the agreement between the performance with respect to the three 
parameters. The seven categories of Table XVII can be ranked,* as 
follows: 


Delay P/AR Loss 
vF 2° load l 1 1 
no rptr 
VIF no rptr 2 2 3 
Tl 3 4 2 
N2 4 3 4 
VF loaded 5 5 5 
VF repeatered 6 6 6 
Nl 7 L 7 


This yields a coefficient of concordance’ of 0.96, which is significant 
beyond the 0.01 point (if one feels that such small probabilities are 
significant). There evidently is a very high degree of correlation between 
these three transmission characteristics. In other words, if for a given 
facility one trunk exhibits a ‘“‘better’” P/AR reading than another trunk, 
it is also likely that it will have ‘‘better’’ loss and delay characteristics. 


VIII. LEVEL TRACKING ON COMPANDORED FACILITIES 


Of the 392 toll connecting trunks in the survey, 189 were on carrier 
facilities and these were compandored. These 139 trunks terminated 
in a total of 40 end offices. 


* The number one (1) indicates lowest loss or delay, or highest P/AR reading. 
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Compandor tracking was measured by transmitting an 1850-Hz tone 
at each of four power levels (0, —10, —20, and —30 dBm) and then 
measuring the received level. Measurements were made in both direc- 
tions of transmission, using a standard oscillator and the 25A Trans- 
mission Measuring Set (or 3A Noise Measuring Set for the —30-dBm 
level). 

The channel net gain for any given input power level can be taken 
as the reference level, and the net gain for other input levels is then 
referred relative to the reference level. For example, Table XVIII 
shows the deviations from channel net gain, relative to a —10-dBm 
input, at the other input levels. So at each input power, the variable 
of interest is the net gain of that power level minus the net gain at 
—10 dBm. Estimated mean deviations relative to any other power 
level can also be obtained from Table XVIII. 


IX. HARMONIC DISTORTION 


The nonlinearities of the transmission path for the toll connecting 
trunks caused harmonic distortion of the signals being carried. The 
amount of harmonic distortion was determined by sending a 1000-Hz 
tone at each of four power levels (0, —10, —20, and —30 dBm) and 
measuring the power level of the received fundamental tone and the 
second and third harmonics. The fundamental was measured with a 
25A Set if above —25 dBm. Below this level, a 3A Set with C-message 
weighting was used. The second and third harmonic measurements 
used the 3A Set with a narrow bandpass filter, centered at 2 or 3 kHz, 
in place of the C-message weighting network. The measurements were 
adjusted to reflect the loss of the bandpass filters. There was also a 
“measurement floor’ to consider due to the sensitivity of the measuring 
equipment. This was determined to be 48 dB (i.e., a fundamental-to- 
harmonic ratio of 48 dB) so that harmonics more than 48 dB below the 
1000-Hz fundamental were measured with increasing inaccuracy as 
the level decreased. Measurements were made in both directions of 
transmission. 

The estimates of harmonic distortion are shown in Table XIX for 
all the toll connecting trunks measured in the survey. The results are 
presented as signal-to-harmonic distortion power ratios (8/D) showing 
means, confidence intervals, and standard deviations. An analysis of 
the data showed that the larger values of distortion (lower S/D ratios) 
almost always occurred on compandored systems (i.e., all the carrier 
facilities). The harmonic distortion on the noncompandored trunks 
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(i.e., voice-frequency cable facilities) was usually below the ‘‘measure- 
ment floor,’ resulting in only a few accurate measurements on these 
facilities. Hence, they were not analyzed in detail. The results for 
compandored toll connecting trunks (carrier facilities) are shown in 
Table XX. Among these carrier facilities a subclass analysis of the more 
commonly used facilities is shown in Table X XI. The estimates there 
are for the signal-to-distortion ratio, 1.e., the total harmonic distortion 
below the fundamental. 

In studying Table XIX and XX one notes that the average distortion 
power is roughly the same for the second and third harmonics. However, 
the second and third harmonic distortion power distribution are dis- 
similar, as can be seen from the plot of their distributions in Fig. 9. 
It shows clearly that for the high distortion levels, a greater percentage 
of the trunks have second harmonic power exceeding a chosen level 
than third harmonic power. About 20 percent of compandored facilities 
have a fundamental-to-second harmonic distortion ratio less than 30 dB, 
while only one percent have a fundamental-to-third ratio less than 
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Fig. 9—Second and third harmonic levels in dB below first harmonic (received 
fundamental) for compandored systems—end and toll office data. Input level of 
fundamental = —10 dBm. 
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30 dB. Hence, when the total harmonic distortion is relatively large 
it is usually due to the second harmonic component. This feature is 
also evident in a distribution plot of the second and third harmonies 
for all toll connecting trunks, and implies a positive skewness’’’’® of 
the population of third harmonic measurements. 


xX. CONCLUDING REMARKS 


The 1966 survey of toll connecting trunks has supplied information 
on the transmission characteristics of a key part of the built-up con- 
nection between subscribers. In addition, when combined with other 
such data, this information has been useful in generating a transmission 
model of the Bell System network. 

Although carried out as a trunk survey, it was possible through 
subclass analysis to derive estimates of the transmission performance 
of toll connecting trunks using specific facilities. These included trunks 
using voice-frequency cable pairs, loaded and nonloaded, and those 
using voice-frequency repeaters, as well as the more commonly used 
short-haul carrier systems, N1 and Tl. These trunk data reflect the 
effects of office equipment as well as those of the facilities. 

Another use of the 1966 survey data has been in making comparisons 
with data from the 1964 Intertoll Trunk Survey.’® Certain subclasses 
of each survey are similar enough to warrant investigation. For example, 
in both cases the trunks of 0 to 15 miles trunk length use voice-frequency 
facilities for about 80 percent of the trunks. Since their makeup is so 
similar their respective estimates for loss, noise, and delay can be 
compared. The agreement of these estimates is very good, and this 
lends mutual confidence to both estimates. The same is true if we con- 
sider the collection of trunks 15 to 30 miles long where N1 carrier 
was the predominantly used facility. 

It should be remembered that although these results accurately 
picture the transmission characteristics of toll connecting trunks in 
1966, with the passage of time changes can occur in the sampled popula- 
tion. Introduction of new facilities and the uneven growth of others 
are two obvious occurrences which create the need for continual updating 
of transmission data. 
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TaBLE I[—EstTIMATED TRUNK LENGTH AND Facinuiry TYPE IN LARGE 
AND SMALL OFFICES 














Trunk Length 
(Miles) Type of Facility (%) 
; Voice 
Office Size Mean + 90% C.I. Carrier Frequency 
Small (<400,000 AOTM*) 
Stratum 1 25.2 + 11.2 63 37 
Large (2400,000 AOTM) 
Stratum 2 6.74 2.4 19 81 











* AOTM: Annual Outgoing Toll Messages 


TasBLeE IJ—TruNKk Lenoers vs Faciniry Tyrr 








Percent of TCTs in Length Categories 
Facility Type 0-15 Miles 15.1-80 Miles Over 30 Miles 
N1 Carrier 5.8 42.0 27 .0 
T1 Carrier 10.2 4.6 — 
Other Carrier* 1.0 32.1 73.0 
Voice-Frequency Cable 83 .0 21.3 — 
100.0 100.0 100.0 





* The facilities in this category consist of ON, ON/R, O, N2, and Lenkurt carrier 
types. 


Taste I[[[—lacitiry Composition or ToLtu CONNECTING TRUNKS 





Percent of all TCTs 





Facility Type Mean + 90% C.I. 
1. Carrier :* 34 
Nl 13 + 8 
Tl 8+8 
Othert 10 
2. Voice-Frequency Cable: 66 
Nonloaded 23 + 11 
Loaded: 43 + 13 
Loaded, nonrepeatered 645 
Loaded, E6 repeater 26 + 10 
Loaded, E23 repeater 74+6 


* 3% of these are carrier facilities in tandem with a section of loaded voice-fre- 


quency cable. . 
t The facilities in this category consist of ON, ON/R, O, N2, and Lenkurt carrier 
types in quantities individually too small for reliable estimates. 
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TaBLE ITV—Swircuinc Usep sy ToLu CONNECTING TRUNKS OF 
SURVEY 





 §Xxs 120 


5XB 20 
SWBD 10 
XBT _ 
4A = 

Total 150 





Stratum 1 (<400,000 AOTM*)—15 Small End Offices 


12 
2 


| | 





Stratum 2 (2400,000 AOTM)—25 Large End Offices 


At End Office 


SxS 91 
5XB 115 
XBT — 
1XB 25 
SWBD _ 
4A — 
PANEL 11 
Total 242 
Grand 
Totals 392 
Notes: SXS: 
5XB: 


LXB: 
SWBD: 
4A: 
XBT: 


Type Trunks Machines 





Switchboard 

No. 5 Crossbar 
No. 1 Crossbar 
Switchboard 

4A Crossbar 
Crossbar Tandem 


At Toll Office 
Trunks | Machines | Trunks 


36 
11 
119 


Type Trunks | Machines 

















5 127 
2 126 
16 119 
2 32 
13 44 
6 25 
a 11 
44 484 
65 784 








Trunks | Machines| Trunks Machines 
4 154 16 
6 68 8 
5 34 6 
4 32 4 
2 12 2 
21 300 36 


Total Both Ends 


Machines 


14 
15 
16 

6 
13 





* AOTM: Annual Outgoing 


Toll Messages 


TABLE V—IREQUENCY RESPONSE OF TOLL CONNECTING TRUNKS 





Switch-to-Switch Loss (dB) 
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TasLe VI—Loss RELATIVE To 1000 Hz (dB) ror CARRIER FACILITIES 
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Tas_LE VII—Loss RELATIVE To 1000 Hz (dB) ror VotcE-FREQUENCY CABLE 
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TaBLE VIII—3A Noise on Totut CONNECTING TRUNKS 














C-Message Weighting 3-kHz Flat Weighting 
Mean 90% C.1. Std. Dev. Mean 90% C.I. Std. Dev. 
Facility Type (dBrn) (dB) (dB) (dBrn) (dB) (dB) 
Carrier 
Ni 22 +0.9 4.8 34.7 +2 .3 7.6 
N2 17.5 — 8.1 38.8 — 7.9 
Tl 16.0 — 3.8 31.9 — 3.8 
All Carrier 18.4 +1.7 6.4 33.3 +1.8 8.0 
Voice-Frequency Cable 
Nonloaded, nonrepeatered 5.5 +1.4 4.4 33.5 +3.1 14.3 
Loaded, nonrepeatered 9.5 +4 .2 7.8 37 .0 +3 .2 13.2 
Loaded, E6 repeater 11.3 +2.1 7.3 35.9 +3.3 11.1 
Loaded, E23 repeater 13.2 +2 .8 6.9 39.4 +4.4 11.4 
All VF 9.5 +1.5 Tz 35.7 +1.9 12.7 
All TCTs 12.5 +1.4 8.2 34.5 +1.5 11.4 
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Taste IX—3A Noise wits 3-KHz Itar WEIGHTING 
(6 = End Office Noise—Toll Office Noise) 








Mean 6 90% C.I. Std. Dev. 
Facility Type (dB) (dB) (dB) 
Carrier 
Nl 6.1 +2.9 9 .2 
N2 9.0 — 10.9 
Tl 3.5 — 10.1 
All Carrier 6.0 +2.7 9.9 
Votce-Frequency Cable 
Nonloaded, nonre- 
peatered 18.7 +3 .4 10.8 
Loaded, nonrepeatered 19.5 +1.9 4.6 
Loaded, E6 repeater 12.0 +2.9 9.2 
Loaded, E23 repeater 13.9 +7.8 12.4 
All VF 15.2 +2.3 10.4 
All TCTs 12.1 +2.2 11.1 





TABLE X—3A NOISE wItH 3-KHz FLAT WEIGHTING 


Mean 90% C.I. Std. Dev. 
Survey (dBrn) (dB) (dB) 
1964 ITT Survey 
(0-62 .5 miles) 29.4 +2.0 9.0 
1966 TCT Survey 
(toll offices) 28.7 +1.4 9.0 


TaBLE XI—3A Noise WitH C-MEssaGE WEIGHTING: 
TrunNK LENGTH ANALYSIS 














Mileage Mean 90% C.I. Std. Dev. 
Category (dBrnC) (dB) (dB) 
0.0—- 2.0 5.5 +1.3 4.3 
2.1- 4.0 9.5 +0.8 7.9 
4.1- 8.0 12.3 +1.9 7.3 
8 .1-16.0 15.6 +2 .0 6.6 
16 .1-382 .0 17.3 +3.0 6.8 
Over 32.0 19.0 +2.2 7.1 











TaBLE XII—C-Messace Noise as A Function or Input Houpinc TonE PowrER 


Received Noise Level (dBrnC) 








Analog Systems Tl Carrier 
(Syllabic Compandor) N1 Carrier (Instantaneous Compandor) 
Transmit Level © |) Sl  — 
(dBm) Mean + 90% C.I. Std. Dev. Mean + 90% C.I. Std. Dev. Mean + 90%* Std. Dev. 
0 39.2 41.8 6.3 41.9+2.1 6.4 46.2 + 2.0 
—10 32.2 + 2.0 6.5 35.1 + 2.2 6.5 39.9 + 2.1 
—20 27.4 + 2.0 6.4 30.5 + 1.9 6.0 30.8 + 2.1 
—30 22.2+42.1 6.3 25.7 41.7 5.2 21.24 2.0 





* Confidence interval not shown because of small sample size. 
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TABLE XIJI—ImputsE Count LEVEL DISTRIBUTIONS 


45-Count Level (dBrnC) 15-Count Level (dBrnC) Slope (dB/decade) 


Facility Type Mean + 90% C.I. Std. Dev. | Mean + 90% C.I. Std. Dev. | Mean + 90% C.I. Std. Dev. 


N1 Carrier 62. 


84.7 8.4 65.9 + 4.7 8.4 5.8 + 1.0 2.1 
T1 Carrier 57.8 + 4.3 7.6 58.7 + 4.3 7.5 2.7 +0.2 1.8 
VF2W: 
NL, NR 43.8 + 4.3 10.1 49.5 + 4.6 11.0 10.8 +1.3 3.8 
L, NR 44.1+ 4.2 10.2 54.1 + 4.8 10.9 8.5 + 2.6 4.3 
L, E6 49.38 +1.9 7.0 52.4 + 2.1 7.2 5.6 + 0.6 2.0 
L, E23 46.8 +1.5 6.7 50.4 41.3 6.8 7.1+0.9 3.4 
All VF* 46.0 + 1.9 8.7 50.5 + 1.9 9.1 8.2+ 0.7 3.9 
All TCTs 49.4+ 2.0 10.2 53.8 +1.9 10.0 7.3+0.8 3.8 


Notes: Measurement interval 15 minutes 
L = loaded cable 
NL = nonloaded cable 
NR = nonrepeatered cable 
E6, E23 = E6, £23 repeaters 
* Includes 2-wire and 4-wire 
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Frequency 
Hz) 


200 

300 

400 

600 

800 
1000 
1200 
1400 
1700 
2000 
2300 
2500 
2700 
3000 
3200 
3300 
3400 


TABLE XIV—RELATIVE ENVELOPE DELAY OF CARRIER FACILITIES* 


Std. Dev. 





* Includes office equipment. 


Mean + 90% Cl. 
3390 + 155 391 
2055 + 88 232 
11384 + 64 140 
416+ 36 75 
175 + 16 47 
63 + 16 47 
27+ 5 22 
6+ 5 19 
0 ms 
17+ 7 18 
82+ 6 24 
159 + 13 40 
2424+ 13 49 
615+ 19 91 
698 + 23 99 




















Delay (us) 
Tl 
Mean + 90% C.I. Std. Dev. | Mean + 90% C.I. Std. Dev. 

1602 + 126 271 1707 + 70 269 
1245 + 144 157 1113 + 66 182 
773 + 101 115 683 + 46 100 
324 + 43 60 269 + 12 36 
1385 + 13 47 117+ 6 20 
102 + 27 69 50+ 5 18 
69 + 34 86 15+ 3 18 
17 + 10 28 O+ 3 12 

0 — 0 — 

41 + 24 89 9+ 1 7 
30 + 10 45 383 + 2 10 
404+ 7 12 544 8 19 
108 + 12 76 92+ 3 9 
170+ 9 26 169 + 4 15 
201 + 17 21 2124+ 2 18 
340 + 15 53 238 + 3 18 
429+ 5 57 271+ 3 16 
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Frequency 
Hz 


200 

300 

400 

600 

800 
1000 
1200 
1400 
1700 
2000 
2300 
2500 
2700 
3000 
3200 
3300 
3400 





TABLE XV—RELATIVE ENVELOPE DELAY OF VoICE-FREQUENCY CABLE FACcILITIES* 


























Delay (us) 
VF2W eee VF2W | Loaded 
Nonrepeatered VF2W-E6 Repeater VF2W-E23 Repeater \ Nonrepeatered 
Mean + 90% Mean + 90% Mean + 90% Mean + 90% 
C.1. | Std. Dev. C.I. | Std. Dev. C.I. | Std. Dev. C.1. | Std. Dev. 

604 + 60 202 679 + 227 505 973 + 522 700 872 + 139 280 
442 + 48 159 559 + 105 304 587 + 271 400 570 + 87 219 
251 + 30 92 297 + 54 186 . 305 + 131 217 299 + 47 121 
102 + 14 39 64+ 29 83 424+ 55 87 96 + 17 47 
50 + 7 21 —4+ 26 55 5+ 28 76 274+ ii 25 
284 4 12 —23 + 21 45 —20 4 24 38 5+ 10 20 
13+ 2 9 —29 4+ 15 34 —41 + 13 47 —-ll+ 9 18 
6+ 1 fi —224+ 9 24 —29 + 24 57 -10+ 8 17 
0 — 0 — 0 — 0 — 
— 6+ 2 é 46+ 16 39 57 + 26 63 21+ 13 20 
—-10+ 3 9 116+ 36 76 142+ 80 109 36 + 17 26 
—ll+ 4 11 177 + 56 113 2124+ 44 84 70 + 27 45 
—-13 4+ 4 12 255 + 75 156 299 + 122 181 121+ 48 68 
—-13 + 5 13 501 + 148 310 576 + 188 265 217 + 68 103 
— 94 5 13 772 + 229 470 899 + 347 464. 320 + 126 167 
- 82 5 13 957 + 309 627 1097 + 440 577 341 + 155 224 
—9+ 5 13 1018 + 363 711 1275 + 601 802 398 + 266 327 





* Includes office equipment. 
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TABLE XVI—P/AR_ EstIMATES 


Category Mean + 90% C.I. Std. Dev. 
1. All toll connecting trunks 93.6 + 1.3 7.4 
2. All carrier facilities: 90.9 + 2.2 6.9 
N1 carrier 88.8 + 1.7 6.9 
N2 carrier 96.2 + 2.6 6.4 
T1 carrier 96.1 + 0.7 4.7 
3. All voice-frequency cable: 95.0 +1.7 7.1 
VF-nonloaded 98.6 + 0.6 2.6 
VF-loaded 93.1 + 2.5 8.0 
VF2W-E23 repeater 92.44 3.8 6.2 
VF2W-E6 repeater 92.34 3.8 9.2 
VF2W-loaded, nonrepeatered 97.0 +1.8 2.8 
VF4W 92.5 4.0 


t 


TABLE X VII—EstimatTED MEANS OF RELATED TRANSMISSION 


PARAMETERS 
Relative Loss* Relative 
Mean Envelope To 1000 Hz 
Facility Type P/AR Delay (us) (dB) 
VF: no load, no repeater 98 .2 —13 0.4 
VF: no repeater 98 .2 36 2.3 
VF: loaded 93.1 475 3.6 
VF: repeatered 92.5 522 3.8 
Carrier: N1 88.8 615 4.8 
N2 96 .2 170 2:7 
Tl 96.1 169 1.9 


* At 3000 Hz. 


TasBLeE XVIII—Companpor LEVEL TRACKING 
(Estimated Deviations from Net Gain at —10-dBm Input) 








Input Power (dBm) Mean + 90% C.I. (dB) Std. Dev. (dB) 
0 —0.3+0.1 0.5 — 
—10 0 — 
—20 0.7+0.1 0.5 


—30 1.2+0.3 1.2 


TABLE XIX—SratTistics For THE Ratio oF FUNDAMENTAL TO SEcOND, THIRD, oR Toran Harmonic PowER 
IN dB (Att Totti ConnEctiNnG TRUNKS) 





Transmitted Level (dBm) 
0 —10 —20 —30 

Mean + 90% Std. Mean + 90% Std. Mean + ne Std. Mean + 90% Std. 

‘ eke 4 li 











Harmonic C.J Dev Dev. Dev Dev 
Second 47.1 +2.4 9.3 48.142.8 9.3 48.9 + 3.0 9.6 48.0 +2.9 9.4 
Third 46.6 + 2.0 9.0 47.9+1.8 7.9 49.0 + 2.0 8.2 47. 242.0 8.4 
Total 43.342.3 8.9 44.342.5 8.6 45.3 +2.8 9.0 43.9+2.7 9.1 
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TABLE X X—STATISTICS FOR THE RATIO OF FUNDAMENTAL TO SECOND, THIRD, OR ToTAL HARMONIC POWER IN 
dB (CARRIER IacILitiEs) 





Transmitted Level (dBm) 




















0 | —10 —20 —30 
Mean + 90% Std. Mean + 90% Std. Mean + 90% Std. Mean + 90% Std. 
Harmonic C.I. Dev. C.I. Dev. C.I. Dev. C.I. Dev 
Second 36.0 + 1.2 6.5 37.0 + 2.9 7.3 37.9 + 3.9 8.2 38.9 + 3.1 7.2 
Third 35.6 + 2.1 6.0 38.2 + 0.9 5.1 39.4+1.5 5.6 39.241.3 5.1 
Total 32.0 +1.3 5.2 33.5 + 2.0 5.5 34.4 + 3.0 6.6 34.9 + 2.4 5.8 
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TasBLE XXIJI—SiGNAL-ro-DisrorTION Ratio FOR CARRIER FacrLirres (IN dB) 





Carrier Facility 














Ni.” N2 ON * TI 
Transmitted | Mean + 90% Std. Mean + 90% Std. Mean + 90% Std. Mean + 90% 
Level (dBm) C.L. Dev C.I. Dev. C.I. Dev C.1. 
0 28.0 + 0.5 3.3 38.0 +1.0 4.2 82.5 +1.0 2.8 34.1 + 0.2 
—10 33.6 + 0.6 4.0 38.9 +1.6 3.2 35.14 1.2 3.0 29.3+40.5 
—20 36.7 + 0.6 3.9 38.2 +0.8 4.7 35.6 +1.5 3.2 28.0 + 0.4 
—30 37.2 + 0.6 3.5 37.0 +1.1 4.6 36.5 + 0.6 3.2 30.0 + 0.5 











Std. 
Dev. 


Rares 
On 
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An Adaptive Echo Canceller in a 
Nonideal Environment 
(Nonlinear or Time Variant) 


By E. J. THOMAS 
(Manuscript received October 19, 1970) 


In this paper we calculate a lower bound on suppression provided by an 
adaptive echo canceller in either a nonlinear or time variant environment. 
Specifically, we examine the effects on performance of nonlinear echo paths 
described by a Volterra integral equation [equation (18)] or time variant 
echo paths caused by phase jitter on single-sideband suppressed carrier 
systems. 


I, INTRODUCTION 


In a previous paper? we have described the performance of an 
adaptive echo canceller in a linear time invariant environment. Qual- 
itatively speaking, the environment in which an echo canceller will 
operate appears to be mainly linear and time invariant; however, 
some echo paths will be nonlinear and/or time variable. Some typical 
causes of nonlinearity are the volume dependent gain in compandored 
circuits and harmonic distortion in amplifiers and repeaters. Time 
variability, on the other hand, may be caused by spurious modulation 
of the carrier of long-haul single-sideband suppressed carrier systems. 
This is commonly referred to as incidental FM or phase jitter. In cer- 
tain cases these anomalies are of sufficient magnitude to degrade the 
performance of an adaptive echo canceller. 

In this paper we examine the operation of an adaptive echo can- 
celler studied previously in an ideal environment? in a nonideal 
environment (nonlinear, time variant). We restrict ourselves to non- 
linearities which do not possess infinite memory and to time varia- 
bility caused by phase jitter. In both cases we derive a lower bound on 
the suppression provided by the echo canceller and empirically verify 


+ This problem occurs when the compressor portion of the compandor is not 
perfectly compensated for by the expandor. 
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our results. For the sake of brevity we restrict our discussion to a 
digital implementation of the type shown in Fig. 1. However, one can 
investigate other implementations of the echo canceller and obtain 
similar results. 


II. GENERAL CONSIDERATIONS AND NOTATIONAL FORMS 


In Fig. 1 we show how an echo canceller would be connected in a 
typical connection. The input signal x(t) produces an echo y(t) cor- 
rupted by noise, ¢(¢). An approximation of y(t), ya(t), is subtracted 
from the actual echo and noise producing a cancelled echo e(t). Ex- 
amining the echo canceller in a little more detail we find that it is 
composed of M digital filters having the set of orthonormal impulse 
responses {A;(kK)}, and the set of outputs {w;(k)} to input x(k).t 
Every tap has associated with it the adaptive network shown for two 
taps in the figure. From Fig. 1 we conclude that the gain of the 7th 
tap at the k + 1 sampling interval is given by 


gi(k +1) = g:(k) + [KI e(k)w,(k). (1) 
Let us assume that the response of the echo path to x(t) may be 
given in the following form: 


y(t) = eZ) * h(t) + ¥@). (2) 
where h(t) is any square integrable function, (*) denotes convolution, 
and y(t)’ is any function which is required to make equation (2) correct. 
Qualitatively speaking we see from equation (2) that we are breaking 
y(t) into a linear component x(t) * h(t) and a distortion term y(t). In 
subsequent sections of this paper we will demonstrate that the nonlinear 
or time variant echo paths that we choose to study yield y(t) ’s of the 
form of equation (2). We will therefore first examine the effect of a 
system described by equation (2) on the performance of an adaptive 
echo canceller. We will then calculate y(t) for a nonlinear or a time 
variant echo path. 


2.1 The Governing Equation 


Since we hypothesized that A(t) was a square integrable function we 
may represent it by a generalized Fourier series 


no = Dero 


ta(k) denotes the value of x(t) at the kth sampling interval; also, all sam- 
pling is assumed to be at the Nyquist rate. ; ; 
+ y(t) may depend on the absolute time when x(¢) is applied. 
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x(t) TO ECHO PATH 
Piss le Maan NS 
FAR END NEAR END 

x(k) 
A, (kK) dw (h) 
W, (k) 





) FROM ECHO PATH 
e(t)=y(t) -ya(t) + &(t) Fy S(t) 


e(t)=e'(t) + §(t) ECHO + NOISE 
CANCELLED ECHO + NOISE 


Fig. 1—An adaptive echo canceller. 


where 
Ge [ ” AD dC) de 
Substituting 55%, C;);(é) for h(t) in equation (2) we obtain 
yl) = Lo Cavalt) + HO 


where w,(t) = x(t) * \,(é). 
From Fig. 1 we see that 


ya(t) = 2d gi(tw(t). 


Therefore, 
M -) 


ef) = Deru) + 2 Cw) + vO + (3) 


t=1 
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where 
r(t) = C; — g(t). (4) 


Combining equations (1) and (4) we obtain 


r(k + 1) = ri(k) — | K | w,(k)e(k). 
Squaring the above equation, summing over the M taps of the echo 
canceller, and using equation (3) we obtain 


Dy ie + 1) — i) = —2 | K | s r(k)w, Ge r(h)w,(h) 


+ a Cyw,(k) + vk) + @) 


M 


1K FY who( rane) 


+ > Cw + vO+i@)- © 


In order to simplify our notation we make the following three 
definitions: 


a(t) = D> Crw.lh) + YO), (6a) 
wa) = Duta, (6b) 


» 
v(k) = a r(k)w,(k). (6c) 


As a result we obtain 


Do (ie + 1) — rif) 


= —2|K | @(k) + a(k)o(k) + ¢(A)o(h) 
+ | K P w(k){o'(k) + 0(k) + PR) + 20(K)o(k) 
+ 2¢(k)u(k) + 25(k)e(k)}. (7) 
The above equation describes the performance of the adaptive echo 
canceller shown in Fig. 1 in the environment described by equation (2). 


However, before we can apply equation (7) to the problem at hand . 
one final definition is required. We define the average value of a sam- 
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pled function f(k) in the interval Qn + 1k S (p+ 1)Qas 


(p+1)Q 


io=% dX 1® p=0,1,2 


k=Qptl1 


(8) 


~ 


where Q is the number of samples per interval and is constant. Therefore, 
{(p) may be interpreted as the average value of f(k) averaged over the 
pth interval containing Q samples. Note that for all p the p — 1 and 
p + 1 intervals are adjacent to and do not overlap the pth interval. 


2.2 Choices of | K | Which Allow Best Match to the Echo Path Fourier 
Coefficients 


Let us now apply our averaging technique defined by equation (8) 
to equation (7). We will assume that the circuit noise ¢(é) is zero mean, 
with variance o7. Also, we will assume that the noise is statistically 
independent of the other variables in equation (7). This is justified by 
virtue of the fact that we are assuming that K is small.’ Furthermore, 
we will assume that the number of samples in the pth interval, Q, is 
large enough so that ¢(p) and ¢*(p) are good estimates of the true mean 
and variance of the circuit noise. As a result of the above we obtain 


1/0 Yo fr i(Qp + 1) + Q] — r:(Qp + I} 
= —2|K |v%(p) — 2| K | 6@ 
+ 2|K |? a(po(p)w(p) 
+ | KP wow) + | K P o’@) 
+) KP (pre). (9) 


From (9) it is clear that the sum of the r? is reduced when the right-hand 
side of the equation is negative.’ In fact, it is continually reduced until 
the right-hand side of (9) can no longer be negative. Therefore a suf- 
ficient condition for convergence? can be written as 


A(p) = | K |? [(o(p) + 6(p))w(p) + o2u°(p)] 
— 2|K | @(p) + 6(p)v(p)) < 0 (102) 


+ This may be more easily seen if Zi-r” 7:2 1s considered to be the magnitude 
squared of a vector whose components are {ri}. In this event the left-hand side 
of the equation is proportional to the difference of the magnitude square of a 
vector at some instant and Q samples earlier. When the right-hand side is 
negative it indicates that the magnitude of the vector is smaller than it was Q 
samples ago. 

In this paper convergence is taken to mean the reduction of the sum of the 
T12, 
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or equivalently 


A'(p) = | K | [(p) + O(p)u"(p) + o2u"(p))] 


— 20°(p) + O(p)u(p)) < 0. (10b) 
By defining 
Bp) = (V(p) + 0(p))U"(p) + 22U"(p), 
D(p) = (V"(p) + 4p) V(p), 
we find 


A(p) = | K |’ B@) — 2| K | D@). 


From the above equation it should be clear that 


A(p) $0 
for 


D(p) 
0<|[K|<2| 22 


and 


D(p) 2 0. 


Therefore we conclude that the echo canceller converges for D(p) > 
0 and 0 = |K| S 2D(p)/B(p). However, since | V(p)| is small in 
the neighborhood of equilibrium, D(p) decreases quicker than B(p) 
and as a result the upper bound on | K | is reduced while convergence 
is taking place. Convergence ceases when 


On the other hand for D(p) < 0 there is no positive choice of K 
which will allow convergence and therefore the canceller diverges until 
D(p) becomes positive and 0 < | K| S| 2D(p)/B(p)|. As a result the 
best choice of | K | which will allow the most convergence (the smallest 
| D(p)| and therefore the best match to the Fourier coefficients C;) is 
the smallest | K |. Ideally | K | > 0 will allow the best match to the 
echo path Fourier coefficients. However, the smaller | K | is made the 
longer it takes for the canceller to converge, and in practice a com- 
promise is required between amount and speed of convergence. 


2.3 A Lower Bound on Achievable Suppression 
We have shown previously (Section 2.2) that choosing | K| very 
small will allow the best match of the tap gains g;(k) to the Fourier co- 
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efficients C;. Taking the limit of equation (10b) as | K | goes to zero 
we obtain 
lim A’) = —20°@) + 6@)@) ¥ 0 


or convergence takes place as long as 


v'(p) > —A(p)v(p) (11a) 
and equilibrium is reached when 
v"(p) = —O@)v(p). (1b) 


A sufficient condition for (11) to be true is 


v°(p) = | e(p)(p) |. 


From the definition of [-] [equation (8)] and applying the Schwarz 
inequality, we obtain 


| e(p)o(p) | S | Ope) | S$ COVE). 


From the above two inequalities we see that a looser but still 
sufficient condition for (1la) and (11b) to be valid is 


v°(p) = ((p))*[6°(p) |} 
or 


v(p) = P(p). (12) 


Equation (12) is a sufficient condition to insure convergence in the pth 
interval. We therefore conclude that when | K | is made infinitesimally 


small the echo canceller will reduce >> ™, r°(p) at least until: 


v'(p) S #(p). (13) 
We will now use equations (13) and (lla) and (11b) to determine 
a lower bound on the achievable suppression. Referring to Fig. 1 we 
define the suppression S as 


S=101 HO. 14 
*B ey? oa 


From equations (3) and (6) it may be easily shown that 
e(k) = v({k) + Ok) 








or | 
ep)” = vp) + 2v(p) 0(p) + F(p). 
Substituting equation (11b) into the above equation we obtain 


'(p)” = v(p)6(p) + 6(p). 
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Also, since 
'(p))” S |v@)e@) | + #@), (15) 
we can apply Schwarz’s inequality to the above and obtain 
’(p))” S @'(p))*(@'(p))? + 6°(p). 
Substituting (13) and the above into (14) we obtain 








S = 10 log, xe S. (16) 


On the other hand, if for a given input signal 


v(p)4(p) = 0," 
we see from (15) and (14) that 


S = 10 log: ee) os = S,. (17) 


Equations (16) and (17) are lower bounds on the achievable 
suppression which occurs when | K | is chosen to be infinitesimal. For 
any other than infinitesimal | K | the suppression may or may not be 
less than the values given by the lower bound [ (16) and (17) ]. How- 
ever, as will be seen in a later section of this paper, for the choices of 
| K | which allow reasonable settling times, (16) and (17) do form 
lower bounds on suppression. 


III. NONLINEAR ECHO PATHS 


We will now become more specific and consider the effect of a non- 
linear echo path. We will restrict our discussion to a class of nonlinear 
echo paths which possess the following four properties:? 


(i) They are time invariant. 

(77) They are deterministic. 

(iat) They are “smooth.” Qualitatively speaking by smooth we mean 
that the echo path cannot introduce any abrupt or switch-like 
changes in the output. If such a change is evident in the output 
then it must be due to a similar switch-like change in the input. 

(iv) They possess noninfinite memory. That is the memory does not 
depend on the remote past. 

+ For example, this occurs when @ and v are independent and one is zero 

mean. As will be seen this is the case for systems exhibiting phase jitter. 


+ Most echo paths encountered in practice satisfy all the conditions except one 
and these are handled separately in Section IV. 
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Echo paths which possess the above four properties may be con- 
veniently characterized by a Volterra integral equation.?-* As a result, 
y(t) shown in Fig. 1 is related to x(t) by the equation given below: 


yd = [nda — 2) dr 
+ a iz ho(r, , T2)a(t — m)a(t — 72) dr, dre 


+P Pies 1) Tat = 0) dey (18) 


| 


We may easily place equation (18) in the form of equation (2): 


Hire) AO Se ee x ne) II 2 - Aas 


n 


Comparing the above equation with equation (2) we see that 


Wd = Po Pine 2a) Te = 2) dey 
| 


As a result we conclude that the bounds given by equations (16) 
and (17) are applicable to any nonlinear system possessing the prop- 
erties outlined in the beginning of this section. We need only replace 
6(k) by 


o 


a(k) = dy Cyw;(k) 


t=M 


+ fo fia ore) Te = 2) drs (19) 


In this case, (kK) is composed of that part of the linear portion of 
the echo path which the echo canceller cannot compensate for due to 
its limited number of taps and the nonlinear portion. 


IV. TIME VARIABILITY 


In this section we are specifically interested in time variability of 
the type predominantly found on single-sideband suppressed carrier 
systems and commonly referred to as incidental FM or phase jitter.t 


t We use incidental FM and phase jitter interchangeably. 


2788 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1971 


We will show that the effect of incidental FM on suppression can be 
calculated directly from the index of modulation, 8, which is easily 
measured. 


4.1 Carrier System Model-Assumptions 


In Fig. 2 we show a block diagram of a typical single-sideband 
suppressed carrier system whose output will be corrupted by phase 


m,(t)=cos [o,t+B¢ (t)] Mo(t)=Ccos wet 


x(t) he (t) b(t) Qe het FAS pu he (t) y(t) 


Fig 2—Single-sideband suppressed carrier system with phase jitter. 


jitter. We will begin by making the following practical assumptions: 
(t) The input signal x(t) is bandlimited to w,. That is, 
Xe) = ee for |w| <a,’ 
0 otherwise. 
(iz) The filter h(t) is a low-pass filter bandlimited to wz, 
Hale) = a for |w|<wg where w; >w, +4; 
| 0 otherwise. 
(iit) The incidental FM is narrow-band FM, 
| Bott) | <K 4/2 vt 
and 
o(t) = cos w,t wo; Kw,. 


(iv) The filter h,(¢) is an ideal bandpass filter, at carrier frequencies, 
which passes the upper sideband, 
< < , 
He) = | for Oo = lw | Sw, tw, +0; ,, > op 


0 otherwise. 


t Capital letters are reserved for functions of frequency, small letters for functions 
of time. 
«. x(t) has the Fourier Transform X(w). 
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(v) The filter hy () is a low-pass filter bandlimited to wz, 
Hy (w) as ey for | w | < Wp where Wp > wW, + Ww; 
0 otherwise. 


4.2 Derivation of the Relationship Between x(t) and y(t) 


Consider Fig. 3 for a moment. H,(w) is an ideal, not physically 
realizable, filter which passes only positive frequencies. That is, 


Ge ‘ 0 20 (20) 
0 w< 0. 


cos [w,t + Bo (t)] e~Jwct 


' t 


Fig. 3—Mathematically equivalent to Fig. 2. 


and Re(-) signifies the real part of (-). It may be verified by the 
reader that Fig. 3 is mathematically equivalent to Fig. 2 as far as x(t) 
and y(t) are concerned. Therefore, we will use it for our analysis 
since it reduces the manipulations required. From Fig. 38, 


c(t) = me [eee + ree (21) 
and 
d(t) = Oo, iBO(t) + 2 pg eee ota ees ct (22) 


Since the second term in equation i is a high-frequency term and 
_ its effect will be filtered out by the filter hg-(t) we will disregard it. 
Using the assumption |¢(t)| K 7/2 we expand e/#* obtaining 


a) = 2 0 + jeg). (23) 


Continuing, we obtain for z(t)? 
eae —. —™~ ‘ 
b(t t)b(t b(t t) b(t 
a(t) = Hate) + jac] = 2D — BERD 4 (20 5 BORO)” ay 
t The reader is referred to Ref. 7 for the proof of equation (24). 
t[-] denotes Hilbert transform of [-]. 
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Therefore, 
—. 
A(t) = 1/406) — Bob) (25) 
and . 
re 6 
y(t) = 1/4O*he OE — B@Mb(M))*hs-). (26) 


Comparing equations (2) and (26) we conclude that they are of the 
same form: 


h(t) = 1/4hg()*ha () 
and 
——™~ 
W(t) = 1/4B@WbO)*he @). 
As a result the bounds given by equations (16) and (17) are applicable 
with 6(t) given by 


0 , a 
a(t) = pay Cyw(t) — 1/48 (o(t) b(t) ‘3 hg (t). (27) 


t=m+ 


However, unlike the nonlinear problem, we may simplify the above 
results considerably as shown below. 


4.3 Simplification of Bound for Systems Displaying Pncidental FM 
If we assume that the basis set {\,;(k)} is complete we obtain from 
equation (27) that 


~~ 
A(t) = — 1/4 BO@bE)*hs- (). 


For the situation where incidental F'M is present we have found experi- 
mentally that quantity v(p) 6(p) [see equation (6)] may be safely assumed 
to be zero. Therefore the bound S, given by equation (17) is applicable. 


For bandlimited signals sampled at the Nyquist rate, 


Fo) lim /n f y(t) sd 


T1770 


0) lim 1/t i * 6°(t) dt 


T2200 





Since the effect of incidental FM on the circuits that we are dealing 
with is a second-order effect and is quite small, it is reasonable to 
assume that 


oS. Oa eee 
[B@ * ha-() 1 >> | B@ML() * he-() |. 
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“ 


Therefore, 


aie [ (0) arm & (tim ee [ " (b(t) * hy-(0))? at) 


Tio 


resulting in 


Fm 1 lm in | WO * ho] rf ,, Sule) de 


“a pe Ch eee ee oy 
7) Pim i/r f[ (OOO) * halk ar © Sy(e) de 
T2700 0 —Ws- Wj 
(28) 
where S,(w) and Sg(w) are proportional to the power spectral density 
of y(t) and 6(t) respectively. 
We will now show that the two integral expressions in (28) are 
approximately equal. In the frequency domain we have 


S,(w) = | B@) |’ | Hz-(w) le 
—~ , 
| 51601») P (29) 


—~ 
| FSO] | | Ha-&) |’. 
The Hilbert transform does not affect the magnitude of the Fourier 
transform. As a result we obtain 
Sow) = | Slob) | * | Ha) |”. (30) 
We will now apply assumption 22 and assume that the phase Jitter 
function ¢(t) is a cosine wave,* 





S 6 (w) 


Il 


o(t) = cos a,t 


where o; is typically either 60 Hz, 80 Hz, or 120 Hz. Equation (30) 
then becomes 


So) = 1/4 | Be@ — w,;) + Bw + ;) |’ | Ha-(@) |?. (31) 

Since the effect of d(t) is only to shift the spectrum B(w) to the 

right and left a very small amount’ relative to its bandwidth, it 
should be clear from equations (29) and oH) that 


i “8 y(w) dw & oes Si (w) dw. (32) 


t §[(t)b(£)] indicates the Fourier transform of $(é)b(t). 


+In practice this is a good assumption. 
§ Recall w, > w;. Typically w, is 3500 Hz and w; S 120 Hz. 
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Using equations (28) and (82) in (17) we obtain 


+ 


S = 20 logio e (33) 


Equation (83) is what we have been striving for; it relates achievable 
suppression to the modulation coefficient of the unwanted incidental 
FM for an echo canceller with a complete basis set. 


V. EMPIRICAL RESULTS 


5.1 Time Variant Systems (Incidental FM) 

Since it is difficult to accurately create a desired amount of inci- 
dental FM on a working L carrier system, an analog simulation 
was used. For all practical purposes it is identical to a real system. 
The only difference is that it allows a controlled amount of incidental 
FM to be inserted. 

We chose flatly weighted noise bandlimited to 4 kHz as our input 
signal. A digital tape was made of the input and output of the L 
terminal simulator and this tape was used as input to a computer 
simulation of the adaptive echo canceller shown in Fig. 1. The results 
are shown in Fig. 4. 

From Fig. 4 we see that there is good agreement between the 
empirical results and the analytical results for peak-to-peak phase 
jitter angles above 4 degrees. Below 4 degrees the echo canceller per- 
formance is limited by the fact that it utilizes a finite number of 
taps. Recall in our derivation of cquation (83) we assumed that the 
echo canceller employed a complete basis set (i.e., infinite number of 
taps). This assumption begins to fail in the vicinity of 4 degrees peak- 
to-peak phase jitter and the performance of the echo canceller be- 
comes limited by the incompleteness rather than by the incidental FM. 


5.2 Nonlinear Systems 

In order to verify our results we simulated the nonlinear echo path 
shown in Fig. 5. We used various combinations of input signals and 
feedback gain constants |K| and in all cases the results were com- 
parable to those shown in Table I for a white noise input. 

A word of explanation is needed to clarify Table I. Suppression was 
calculated according to equation (14). The sampling rate was 0.1 
ms and the averaging interval, Q, consisted of 501 samples, i1.e., a 
50.1-ms interval.*t Forty such averages were computed during the two- 


* 6 must be in radians. 
+See equation (8). 
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SIMULATION WITH 
WHITE NOISE INPUT 


SUPPRESSION IN dB 


INCIDENTAL FM 

FREQUENCY =80HZ 

NUMBER OF TAPS=M=100 
IK] =5.41 x10-3 

SETTLING TIME =1.0 SECOND 





6) 2 4 6 8 10 12 14 16 18 
PEAK-TO-PEAK PHASE JITTER IN DEGREES 


Fig. 4—Time variant simulation. 


second time period after the settling time had elasped. The minimum 
and maximum suppression shown are those of the forty calculated 
suppressions, and the average suppression shown is the ensemble 
average of the forty calculated suppressions. 

For all variation of input signals and feedback gain constants 
tested, we found that S; was 3 dB closer to the actual suppression 
than S,; and within 5 dB of the actual suppression. However, as ex- 
pected, the echo canceller always performed better than either bound. 


VI. CONCLUDING REMARKS 


We have obtained lower bounds on the suppression provided by an 
echo canceller in either a nonlinear or time invariant environment. 
Although theoretically the bounds are only valid when an infinitesimal 
feedback constant is used, we have found empirically that they apply 
for any practical choice of feedback gain constant. If one were able 


x(t) LINEAR Z(t) O(t) LINEAR y(t) 
FILTER FILTER 


3 a, = 0.684 
ort) = >> an fzcty |” ap = 0.0211 
n=] a3 = 0.0361 


Fig. 5—Nonlinear simulation. 
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TaBLE I—Wu1teE Noise Input SIGNAL 


Feedback gain |K| 5.1 * 10-3 
Settling time 1.0 second 
Actual achieved suppression in dB Min. = 18.2, max. = 22.1, ave. = 20.4 
Bound 8, [equation (16)] Min. = 10.6, max. = 15.1, ave. = 13.7 
Bound 8S, [equation (17)| Min. = 13.6, max. = 18.1, ave. = 16.7 


to measure the time variability of echo paths or to measure echo paths 
and then to separate the response into the linear and nonlinear por- 
tions [equation (18)], one could predict the minimum achievable 
suppression. Measurement of nonlinear echo paths has been the subject 
of study of others.®° 

For a nonlinear environment we have found that the nonlinear 
portion of the response and hence the lower bounds are dependent on 
the input signal. As a result, if this method is going to be used to pre- 
dict the performance of an echo canceller for a speech input signal, 
it will be necessary to design an interrogation signal which possesses 
some of the same properties as does speech (syllabic rate, power 
level, peak-to-rms ratio, etc.) 

The derived equations which relate the effect of incidental FM on 
suppression for the single-sideband suppressed carrier systems are 
sufficiently general to provide lower suppression bounds for different 
or more complex systems which exhibit incidental FM. One must 
merely know an effective index of modulation, 8, measured between 
the input and the output. The calculations for more complex systems 
are rather involved and have been omitted. 

Laboratory simulated echo cancellation for a nonlinear echo path 
and for an echo path with incidental FM produced cancellations some- 
what better than the calculated lower bounds. Knowledge of the type 
of nonlinearity or time variability in the echo path environment may 
permit one to empirically increase the lower bounds derived in this 


paper. 
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Some Considerations on the Application 
of the Volterra Representation of Nonlinear 
Networks to Adaptive Echo Cancellers 


By E. J. THOMAS 
(Manuscript received October 19,1970) 


In this paper we apply the Volterra representation of nonlinear systems 
to the echo control problem and propose a generalized adaptive echo canceller 
which compensates for nonlinear echo paths. We prove that the proposed 
echo canceller converges and reduces the echo to zero and finally we suggest 
other applications for the system. 


I. INTRODUCTION 


The Volterra functional representation of a nonlinear system is a 
generalization of the well known convolution integral used for linear 
systems. The validity conditions for the Volterra model are suffici- 
ently weak to be satisfied in many practical applications. 

In this paper we apply the Volterra representation to the echo con- 
trol problem and propose a generalized adaptive echo canceller which 
compensates for nonlinear echo paths. We prove that the proposed 
echo canceller converges and reduces the echo to zero in the absence 
of noise. We also suggest other applications for the adaptive echo 
canceller presented. 


II. THE VOLTERRA FUNCTIONAL REPRESENTATION OF A NONLINEAR 2-PORT 


For linear systems it is well known that the impulse response com- 
pletely determines the input-output relationship. The output signal, 
y(t), is functionally related to the input, x(t), by the convolution 
integral, 


ui) = [wea — 9 ar, () 


where h(t) is the system impulse response. It has been demonstrated 
by other investigators’ that nonlinear systems, whose outputs do 
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not depend on the infinite past, obey a more general functional rela- 
tionship, 


1 Se: i : hega@ =2ehae 
+ [ [ he(r, , T2)a(t — 71)x(t — 72) dr, dre 


+f fas a Tat- dn. @ 
| 


n 


This is an extension of the familiar power series representation of 
a memoryless nonlinear system and provides for the system to have 
memory. It is applicable to all nonlinear systems whose outputs de- 
pend on the remote past to a vanishingly small extent. For example, 
it is not applicable to hysterisis and/or switching systems. The terms 
of (2) were first studied by Volterra* and are called Volterra func- 
tionals. The kernels, An(r1, *** , 7), are generally called Volterra 
kernels. 

For the systems that we are concerned with we will assume the 
following: 


(7) The zero input response, ho(é), is identically zero. 
(iz) The system is causal so that ha(71, t2, +++ , 72) = Ofor any 7; < 0. 
(iit) The system is stable so that for all n, 


/  f [ha(t1 , °°* 5 Tr)) dr, , dt2, °°: , dm < oO, 
0 0 
I 


Because of 7 we may rewrite (2) in the form 


uw) = Xi wld, () 


where 


ni) = [oo [itn ss) = ded. 


iI. AN ADAPTIVE ECHO CANCELLER FOR NONLINEAR SYSTEMS 


We will now generalize the adaptive echo canceller proposed by 
Sondhi®> to cancel the echo of a nonlinear echo path for which the 
Volterra representation is applicable. 
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Since the Volterra kernels are square integrable we may represent 
them by an n dimensional generalized Fourier series,&° 


foe) 


h(n » T2, °°" 5 Fa) — Ds a TES » OPE es 


41421 t2=1 in=l 


‘T;,(7) 1, (72) mS Fie T;,(7»), (5) 
where {I;(t)}* is an orthonormal set complete in the L2 space. The 
coefficients of (5) are given by 


«o wo foo] 
CA e = i | ogre i: h,(r1 9 T2 > 7° "% 4g Tn) 
0 0 0 


‘CeGyl ee) 2 Tea. oer dite (6) 
We will assume that the highest order nonlinearity is of order N. Then 
substituting (5) into (3) we obtain 


N [e) 


y(t) = Ps >» s see > Cog. cauide [ T',,(7,)a(t — 7) dz, 


n=1 ¢41=1 t2=1 tn=1 ) 


i Ce err eee j Tiwese des. 


Before we proceed it will be convenient to adopt a shorthand nota- 
tion. We will define 


OS DO, 8) 


t1=1 t2g=1 tn=l 


Cie = Ce eee a (9) 
and 
w= | Dota aan! (10) 
Thus, (7) becomes 
y(t) = Ds 2d Cy,Wi,Wi, °° * Wi, - (11) 


Now consider the system shown in lig. 1. It contains N subsystems 
designated by the circled numbers. The filters having the set of mutually 
orthonormal impulse responses {T;,(¢)} and the set of outputs {w,,} 


* Some typical sets {T,(¢)} are the impulse responses of tapped delay lines or 
Laguerre networks. 
t Note that w,, is a function of ¢ although it is not explicitly shown. 





NOTE: Cr, ae ory 


x(t) 


2(t)= y(t) - G(t) 
plLgigses ty 


Fig. 1—An adaptive echo 








NONLINEAR 
SYSTEM 


N 
y(t) aa 2 C1 Wi, Wig eee Win 


canceller for nonlinear systems. 






0086 
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due to the input z(t) are common to all the subsystems. Subsystem 1 
produces the output of the filters taken one at a time. Subsystem 2 
produces all possible combinations of the product of the filter outputs 
taken two at a time (i.e., w;,w;,), and subsystem N produces all possible 
combinations of the filter outputs taken N at a time (i.e., w,;,w;,Wi3 °*° 
w,;,). Every output tap of every subsystem has associated with it two 
multipliers and an integrator connected in the configuration shown in 
Fig. 1 for only three taps. Other quantities pertinent to the discussion 
below are also defined in Fig. 1. 

We will now show that the tap gains, G,,(t), of the system converge 
to the generalized Fourier coefficients Cz, of (6) and, consequently, 
that g(t) converges to y(t) so that the residual echo, z(t), vanishes. 

We first define 7 


R;,(t) = C1, es Gz; (t). (12) 


Then, by inspection, the equation governing the gain of each of the 
taps of the pth subsystem is 


N 
dG, (t) z= | K | Wi,Wi, °° * Wi, oD > R,,()w:,:, se We (18) 
I 


dt . ani ‘Te 
From (12) we see that 
aR; ,(t) _ _ dG, (d) | 
dt dt a) 
Also, 
dR (t) _ dk ,(t) 
ao 2h, (t) di (15) 
Applying (14) and (15) to (13) we obtain 
2 N 
ei = 2 | K | R7,(t)w;,wi, “Wi, x ys Rz,()wi,Wi, «°* Win - 
n=1 In 
(16) 


Summing over all the taps within the pth subsystem, and over the 
N subsystems, yields 


d N N 2 

di 2. Dd R;,(1) = —2|K | (x 2, R,()wiwi, ue ws.) : (17) 
p= P n=1 n 

Note that the right-hand side of (17) is always negative or zero. 


* Where the set (titz «++ ip) are the components of J,. 
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Also note that 


N 


W=> 2 R;,(t) = 0. (18) 


p=1 p 
Thus, we have a nonnegative function, whose derivative is always 
nonpositive, and conclude that w(t) must be nonincreasing. It is 
strictly decreasing whenever 


N 
= se R,(t)wi,W:, me Win Z 0. 
n=1 In 
Since, in general 
Wi,Wi, °° Wi, ~ O, 
it is clear that 


lim y(t) = 0. (19) 


t0 
Since every term in y(t) is positive, we further conclude that 


lim #;,(f) = 0 for p=1,2,---,N. (20) 


t70 


Applying (20) to (12) we see that 


limit G7,(f) = Cy, ; p=1,2,--- ,N. (21) 
t-00 
Thus we have shown that the tap gains converge to the generalized 
Fourier coefficients of ha(t,, ***, t™), which is what we set out to 
prove. 


The reader can no doubt appreciate that solving (17) for anything 
but the final value would be difficult and would only be valid for a 
specific x(t). The solution for even a very simple input signal may 
not be tractable. We will not attempt such an analysis here. However, 
from (17) we see that the larger we make |K| the quicker the echo 
vanishes. However, |K| cannot be made arbitrarily large because 
doubling talking periods* will cause divergence problems. 


IV. EFFECTS OF CIRCUIT NOISH AND OTHER CONSIDERATIONS 


The previous analysis assumes a noiseless output signal y(t). When 
circuit noise, n(¢), generated in the echo path is present equation (17) 
would be written as 


* Doubling talking is said to occur when both speakers in a telephone con- 
versation are speaking simultaneously. 
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lau 


N N 2 
DRO = 21K 1(Y DROwiaw., + w.) 
7=1 Ip n=1 Ty 


Q 


N 
— 2] K [nol TD Rew... «++ w.,): 


By choosing |K| small so that the rate of change of the G,, is small, 
it is reasonable to assume that n(t) is statistically independent of all 
other variables in the equation. Then, assuming n(t) has zero mean, 
upon taking the ensemble average of both sides of the above equation 
we obtain 


2s = Rio) 
= -2|K| (x © Rr.(Ow.,00:. + w..) ): a 


The same argument used before then yields 


NV 


lim (> > Rit) = 0. (23) 


£00 j=l 
Of course, before this limit is reached our statistically independent 
assumption would no longer be valid, but at least we may conclude 
that convergence will take place until the noise power becomes com- 
parable to the echo power. 

There are several other difficulties associated with the echo can- 
celler. Recall that we represented the nth-order Volterra kernel by an 
“n” fold infinite summation [equation (5)]. Obviously, if such a sys- 
tem is to be implemented it cannot include an infinite number of 
elements and (5) can only be approximated by 


Ji Jo Js 
h,(n go Tn) awd) Dy Ds beaded ys Gis weve Fa) Pelt) di! i & ee 
(24) 


where the J’s are finite and as small as possible for an adequate descrip- 
tion. 

Furthermore, even if the J’s are small the system shown in Fig. 1 
requires a large number of components. However, one point may not 
be obvious. It can be shown that the Volterra kernels are symmetrical.” 
As a result it may be verified that all the C,,,,...;, are equal for any 
permutation of a specific set of numbers 2, 72, -+-, 2). For example, 


C1o3 = Coa1 = C12 = Cota, ete.. 
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Since the set of gains, G;,...;,, for any combination of a specific set of 
integers 7, ---: 2, are the result of the same physical operations, only 
one tap is required to account for all such terms. For example, consider 
the w,w, tap and the w.w, tap. From Fig. 1, 


Gra = f unwaly() — (0) at, 


Ga = f wanty() — 90) at. 


It is obvious that 
Gar = Gio. 
The contribution of G,. and G2. taps toward ¢(t) is given by 
Gt) lie = Grows + Goywow, = (Gie + GoiWi2 = 2GiWye. 


Thus, the G,. and G,, taps may be replaced by a single tap. Any set 
of taps, G;,...;,, for any permutation of a specific set of numbers, 
1, +++ 2,, can be replaced by a single tap. This considerably reduces 
the total number of taps required. 


V. OTHER APPLICATIONS 


Although we have stressed the echo cancellation application, these 
ideas may also be useful in other areas. Two possible uses are described 
below. 


5.1 Compensator for a Nonlinear System 


Assume that we wish to linearize the nonlinear system shown in 
Fig. 1, such that the resulting output, z(¢), can be expressed by the 
linear convolution integral, 


sas i MG aoa d (25) 


This can be done by first allowing the adaptive system of Fig. 1 to 
converge long enough so that the members of the set {G,,(t)} can be 
considered to equal the corresponding members of the set {C,,} [equa- 
tion (21)]. After convergence, the members of the set {G;,} are forced 
to zero while the members of the sets {G;,} , 7 ¥ 1 are fixed at the values 
determined previously. As a result the compensated output will satisfy 
equation (25). 


ADAPTIVE ECHO CANCELLER 2805 


5.2 Nonlinear System Synthesizer 


Suppose one wishes to study the electrical characteristics of a non- 
linear system which cannot be brought into the laboratory. He could 
do this by making input/output tape recordings of the system, and use 
these as an input to a computer simulation of the adaptive system of 
Tig. 1. A good choice of an input signal would be white noise or any 
other easily generated broadband signal. By allowing the simulation 
to converge and then fixing the tap gains G,, at their final value, the 
nonlinear characteristic can be identified. Then it can be determined 
how the field system will behave for any arbitrary input by applying 
this input to the computer simulation with the tap gains fixed at the 
values determined previously. 


VI. SUMMARY AND CONCLUSIONS 


The Volterra representation is a concise method of characterizing 
nonlinear systems with memory when the output does not depend on 
the infinite past. It is a generalization of the convolution integral 
used in linear analysis, and many familiar concepts may be extended. 
Using the Volterra representation, we have proposed a generalized 
adaptive echo canceller capable of nonlinear compensation. A dis- 
advantage of the proposed echo canceller is its complexity. The pursuit 
of these ideas would be greatly enhanced if an efficient means of meas- 
uring the Volterra kernels of an unknown network could be found. 
The author and other researchers are presently working toward this 
end and several ideas have been proposed. To date, unfortunately, 
they all suffer from the complexity problem. 
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Some Comparisons of Load and Loss Data 


With Current Teletraffic Theory* 


By R. I. WILKINSON 
(Manuscript received February 26, 1971) 


Data are presented and compared with current traffic loss theory in three 
trunking areas: high-usage groups; full (nonalternate routed) groups; 
and final groups. Both single hour and average busy season busy hour load- 
loss comparisons are made. Methods of estimating offered loads from carried 
loads, and from the proportion of calls blocked, are considered. The blocking 
which resulis from offering nonrandom traffic to overflow groups is dis- 
cussed. Modifications tn the theory commonly applied are indicated in each 
case when necessary to obtain satisfactory agreement with observed values. 


I. INTRODUCTION 


The ultimate test of theory is comparison with reality. The procure- 
ment of reliable and reproducible traffic data is relatively difficult 
since little control can be exercised over the character and level of the 
input to the load carrying system without invalidating the genuineness 
of the input. There is then a continuing need to compare the com- 
monly used load-loss relations with observations made under a variety 
of trunking configurations and load characteristics. Data are presented 
here and compared with loss theory in three trunking areas: high- 
usage groups; full (nonalternate routed) groups; and final groups. The 
various findings are summarized at the end of the paper. 


Il, HIGH-USAGE GROUP STUDIES 


2.1 Loads and Losses on High-Usage Groups 


It is commonly assumed that the requirements of a “random” or 
Poisson input are met until originating traffic has passed through a 
restrictive group of paths or switches. It would be difficult and cer- 


* Substantially as presented at the Sixth International Teletraffic Congress, 
Munich, 1970. 
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tainly impractical in most operational situations to check such an 
assumption by examining the call arrival instants or to analyze their 
interarrival times, as well as make a corresponding study of their 
service times. Since in any event exact conformity with the theoretical 
assumptions could not be found, the question would remain as to the 
relative adequacy with which they were met. 

The traffic engineer’s usual wish is to describe the blocking which 
will occur in real-life situations when a given average load is offered 
to a group of paths or switches. It will generally suffice then to com- 
pare observed load versus loss relationships with those theoretically 
derived, rather than attempt an assessment of the agreement of more 
basic requirements. 

The simplest trunking situation occurs when a parcel of traffic 
arising from an indefinite (but potentially large) number of customers 
is offered to a full access high-usage trunk group. A high-usage group 
is presumed here to receive calls quite directly from customers without 
earlier significant constriction and serves them immediately if paths 
are available. If, however, all of the high-usage trunks are busy, ar- 
riving calls are alternate routed to other groups in the hierarchy; and 
negligibly few calls return to the high-usage group for retrial. Thus the 
conditions for the application of Erlang’s loss formula are apparently 
well satisfied. How well do observations made in actual exchanges 
agree with this theory? 

Data of the above character have been obtained on many high- 
usage groups, both interlocal and intertoll. For making comparisons, 
hourly readings of three recording registers are commonly taken: 


(t) Number of calls offered, m 
(it) Number of calls blocked, n 
(iit) Average number of calls seen on trunks during the period by 
switches-in-use counts taken every 100 seconds, giving an estimate 
of the average load carried in erlangs, ¢. 


Observations made by A. Descloux’ indicate that a substantially un- 
biased estimate a of the hourly offered load is obtained from 


.=1/(-4) 0 


Typical results of plotting proportion blocked, n/m, versus load offered 
a are shown in Figs. 1, 2, and 3. Figures 1 and 2 are for interlocal groups 
studied at an Arlington, Massachusetts, No. 5 crossbar office, and at 
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Kildare (Chicago), a No. 1 crossbar office. Figure 3 data are on intertoll 
groups at Memphis, Tennessee, a sectional center in the toll hierarchy. 

We know that over a long period an equilibrium load of a erlangs will 
show a significant hour-to-hour variability in offered loads.” Also, when 
such variable loads are offered to a trunk group, we should expect the 
individual hour losses to tend to fall on the convex side of an arithmetic 
plot of the long-run theoretical load-loss curve, since when aggregated 
their average loss should just meet the theory. To examine this phe- 


1.0 





— — — OBSERVATIONS (NUMBERS ARE GROUP-HOURS OF DATA) 
ERLANG E; THEORY 







0.9 


a8 


GROUP 23 ARLINGTON 
GROUPS 6, 43 KILDARE 


° 
N 


sd 
ro 





3 TRUNKS 
GROUP 12 ARLINGTON 
GROUPS 5,88 KILDARE 


° 
s 


Y 
oe 


PROPORTION OF CALLS OVERFLOWING 
° ° 
w oO 


() 
 @ 3 
1 
7 TRUNKS 
S20 GROUP 18 ARLINGTON 
GROUP 38 KILDARE 
O GROUP 90 KILDARE 


0.2 


2 3 4 5 6 7 8 9 
Q=AVERAGE LOAD OFFERED IN ERLANGS 


Fig. 1—Comparison of Erlang’s loss theory with observations on interlocal 
high-usage trunk groups: busy hour, Arlington, 29 days; and busiest 3 hours, 
Kildare, 20 days, 1958. 
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Fig. 2—Comparison of Erlang’s loss theory with observations on interlocal 
high-usage trunks: Kildare group No. 12, 14 trunks, 11-12 a.m., 18 days, 1958. 
(* single hour. ° aggregated hourly values.) 
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Fig. 3—Comparison of Erlang’s loss theory with observations on intertoll high- 
tees cunt groups, Memphis, 1957-1958: 5 trunks to Milwaukee, 11 trunks to 
Cleveland. 
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nomenon under controlled conditions two simulations were run with a 
designed offer of 1.50 erlangs comprised of 180-second exponential 
holding time calls. The actual average offer observed was d = 1.485 
erlangs to 1 trunk for 400 hours, and 1.409 erlangs to 4 trunks for 100 
hours. The results are displayed on Fig. 4. The individual hour losses, 
while conforming generally to the shape of the theoretical curve, appear 
to be slightly more linearly disposed. As longer summary periods are 
used, the observed values would be expected to approach the Erlang 
loss curve. This is seen clearly in the simulation results. A slight sem- 
blance of this tendency is seen on Figs. 1 thru 3, but it is considerably 
obscured by other fluctuations. 
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Fig. 4—Load vs loss simulation, with equilibrium offered loads, results summarized 
by 1- and 2-hour periods. (Numbers at points show hours of data averaged; @ and # 
are mean values for all hours.) 
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We are directed to the conclusion that although Erlang’s loss theory 
is properly applied only to a system operating in statistical equilibrium 
over a long period of time, for engineering purposes it is suitable for 
describing conversation type traffic load versus loss results summarized 
by periods as short as one hour (summary period = 20 average holding 
times). 


2.2 Lstimation of Hourly Offered Loads from Switch Counts 


It is customary to furnish only a switch counting device on high-usage 
trunk groups, and to depend on the accuracy of Erlang’s load-loss rela- 
tion, 

a’ /x! a 
ltat+a/2!+---+a'/a!’ 


to estimate the corresponding offered load by solving for a in 
£ = afi — #,,,(a)] (2) 


where x is the number of trunks in the group. Desclouxt has shown 
that a certain amount of bias occurs by this method. The reason for 
this is made clear by examining Fig. 5. 

Figure 5a shows the loads offered, as estimated by equation (1), to 
three 1-trunk groups during the 3 busiest hours of the day for 20 busi- 
ness days. The load carried (= occupancy here), as determined by 
switch counting, is plotted as the ordinate. The Erlang relation is 
drawn as the solid line. 

The points comprise a correlation scatter diagram. The classical 
regression line of “‘y on 2,” that is, the average value of carried load 
for a selected average offer, would be expected to agree with the 
Erlang loss relation, since the latter is derived on the basis of offered 
load as the ‘independent variable.” The “y on x” regression line gen- 
erated from the data is indeed seen on Fig. 5b (dashed broken line) to 
conform nicely with the Erlang theory. 

Unless there were perfect correlation, the “x on y” regression line 
could not be expected to agree with Erlang’s theory, and this is 
corroborated by the observed regression (solid broken) line on Fig. 5b. 
At lower than median occupancies (and offered loads) this regression 
line lies slightly below the Erlang theory, and for larger than median 
occupancies, it lies consistently above. Thus when estimating offered 
loads from carried loads by Erlang’s loss formula [equation (2)] we 
should expect the values to be smaller than true for lower than aver- 
age occupancies, and greater than true for larger than average occu- 


i, (a) = 
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Fig. 5—Carried vs offered loads, data on three 1-trunk high-usage groups, 
Kildare, 1958, 3 hours per day, 20 business days. (a) scatter diagram, (b) observed 
regression lines. 
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pancies. For example, for one trunk at an observed load carried 
(occupancy) of 0.75 erlang, Erlang theory would predict an offered 
load of 3.0 erlangs, while the observed regression line would more 
nearly suggest an offer of 2.64 erlangs, a difference of 14 percent. 

An appropriate theoretical expression for the “x on y” regression 
in this circumstance is difficult to generate, depending partially as it 
does on the day-to-day distribution of the offered loads, and this last 
becomes a matter of observation of customer characteristics. Since the 
regression lines may be decidedly nonlinear, the theoretical approaches 
of simple correlation theory are inadequate. In the face of such dif- 
ficulties, one reverts to the reduction of data taken in the field for 
practical estimating results. 

Figure 6 shows the ratio of two estimates of the offered load, a2/ay , 
versus observed occupancy of several 1-trunk groups. Here a, is the 
assumed unbiased estimate from equation (1), and az is from equation 
(2), the latter making use only of the carried load measurement. In 
spite of the rather wide differences in loading among the five groups 
shown, the az/a; ratios are quite similar and one would not have too 
much difficulty in drawing a central ratio line through the field for 
general correction of az values. 

Figure 7 shows a similar chart for three trunks, indicating the ten- 
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_ Fig. 6—Ratio of estimates of single hour loads offered to 1 trunk. Estimate a 
: ue switch and call counts, estimate az from switch count and Erlang’s loss 
ormula. 
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Fig. 7—Ratio of estimates of single hour loads offered to 3 trunks. Estimate a: 
is from switch and call counts, estimate az from switch count and Erlang’s loss 
formula. 


dency for the ratios to approach 1.0 with increasing trunk group size. 
Figure 8 shows a rough summary of the occupancies at which the 
ratios would exceed 1.05 with various-sized trunk groups. Thus one 
might conclude, for example, that estimates by Erlang’s loss formula 
of hourly offered loads on 12-trunk groups would not be acceptable 
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Fig. 8—Median occupancy (load per trunk) beyond which ratio as/a: for 
single hours exceeds 1.05 (approx.). Kildare data for 3 morning hours. 
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without correction (that is they would over-estimate by more than 
5 percent) for occupancies greater than 0.75. 


2.3 Hstimation of Average Overflow Load from Average Offered Load During 
a Busy Season 


When an offered load a varies from hour to hour in excess of the 
amount expected in l-hour segments of a longtime load in statistical 
equilibrium,” the average overflow & over a period of time will exceed 
that estimated by entering the Erlang loss relation with 4, that is, 
a > a-E,,,(d). This result is caused by the always concave upwards 
shape of the offered load, overflow load curves. The amount of such 
excess will depend strongly on the magnitude and character of the 
offered loads; in any event as a gets larger, the effect decreases for a 
given size of trunk group. 

Numerous studies have shown that day-to-day busy hour variations 
in the busy season tend to follow a Type III Pearson distribution, 


6(a) = Ka'e™“, (3) 


in which the constants are determined from the mean & and variance 
v of the data, as 


h=a/—1 

c = a/v 
and K is the normalizing coefficient, c’+1/r(h + 1). Typical is the 
example shown in Fig. 9 for a 9-trunk interlocal group in which @ = 
5.72, Var(a) = 1.61, yielding h = 19.20, c = 3.55. 

Again, there is found a considerable correlation between the day-to- 
day variance and the mean of such a distribution. Figure 10 shows on 
a log-log plot the field of variances versus means of loads offered 
during 3 hours each day for 20 days on 72 high-usage interlocal groups 
at Kildare office. Summaries at other exchanges in the United States 
confirm a similar association of variances and means. The general 
line of regression of variance on mean for the corresponding scatter 
diagrams is shown by the solid line, whose equation is approximately 


Var (a) = 0.314. (4) 


(A dashed line has also been drawn to approximate the major axis of 
the elliptical pattern of points. Its equation 1s 


Var (a) = 0.134". (5) 


It will be referred to in a later section.) 
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If the Erlang loss formula can be used to estimate the proportion of 
calls which overflow a high-usage group during a single hour, the average 
load overflowing, @, over a series of hours is then calculable from 


a = i ; Hl, Cayo) de: (6) 


Similarly, the day-to-day variance of the overflow loads is determined 
from 


@ 


Vana: / [a x(a) }°0(a) da — a. (7) 


a 


Curves have been calculated by numerical integration, using the regres- 
sion relation of Fig. 10, which give the ratio of & to a, , the latter being 
the overflow corresponding to an offer of d, found according to 


a, = GH,,,(da). (8) 
Values of @/a, are given on Fig. 11. At constant loads on the last 


trunk (shown by dashed lines for 100 call seconds per hour, or CCS) 
the ratios &/a, are relatively constant; hence a simple table, Table I, 


__— OBSERVED 19 HOURS 
0.90 a= 5.72, VAR (a) =1.61 


_— TYPE I FITTING CURVE 
- 
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Fig. 9—Variations in day-to-day busy hour loads, Kildare group No. 67, 9 
trunks, 20 days. 
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Fig. 10—Day-to-day variance vs average load in clock hours, 72 interlocal high- 
usage groups, 3 hours, Kildare, 1958. 


using the last trunk load as the index is adequate for many working 
purposes. An example of the need for correcting by the factors of Table 
I is shown in Fig. 12. In the left diagram, uncorrected a; values are 
plotted against average overflow loads, &@, , calculated by the believed 
generally unbiased procedure 


~i yy, mlm _ 
pat Seare 9) 
where s is the number of hours summarized for each group. [Compare 
with equation (1).] 
After correction the overflow estimates are shown in the right diagram 
to be in much better agreement with the @, values. 


2.4 Hstimation of Average Offered Loads from Average Loads Carried 


To obviate the labor of calculating individual hourly estimates of 
the offered loads from observed hourly carried loads on high-usage 
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Fig. 11—Correction required in a, to estimate &. 


groups (which would in turn require a correction as discussed in Section 
2.2), the carried loads ¢ are commonly averaged first, and this average 
£ is then entered in Erlang loss theory curves or tables to obtain az from 


= afl — £,,,(a7)]. (10) 

One needs then to compare az with the true offer a. This is done as 

follows. Choosing a value of @ as the offer to x trunks, & is obtained 

by correcting a; employing the appropriate factor from Fig. 11. One 
then obtains ¢ from 

l=a-% (11) 


TABLE I—CoRRECTIONS IN a, TO ESTIMATE & 


Range of Corrections Corrections to ag 
Load on the Last Trunk Seen in Fig. 11 for Practical Use 
CCS erlangs 

8 0.22 1 .32-1.37 1.33 
10 0.28 1.22-1 .28 1.25 
12 0.33 1.16-1.21 1.20 
14 0.39 1.11-1.16 1.15 
18 0.50 1 .05-1 .09 1.08 
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Fig. 12—Effect of corrections by Fig. 11 on estimates of overflows from high- 
usage groups of Kildare Tandem No. 1, 9-10 a.m. 


since that load which does not overflow must naturally be carried. 
Relation (10) then gives az. A field of values of @/az have been calculated 
and form the curves of Fig. 18. We see that the ratio is sensitive both 
to proportion of overflow and to the number of trunks. 

Corresponding data for Kildare groups are shown on Tig. 14. Similar 
forms of probability density distributions are observed. 

It will be noted that there is a generally maximum ratio on lig. 13 
which tends to occur roughly at /,,,(a) = 0.2, that is, at an expected 
overflow of 20 percent. It is interesting that this maximum correction 
required on a; occurs squarely in the middle of the most common 
economic high-usage group operating levels. 

For practical use we have constructed the traces of the 3-dimensional 
surface of Tig. 13 which correspond to several values of ‘economic 
CCS on the last trunk.” (lor a discussion of economic CCS, see Ref. 3, 
Section 7.6.) The theoretical @/az ratios for 8, 14, 20, and 25 CCS on 
the last trunk are shown on Fig. 15. Comparison with data taken on 
a number of intertoll high-usage groups at Memphis, Tennessee, is 
shown in Fig. 16. Although the dispersion among individual groups is 
considerable, the grouped average values show reasonable agreement 
with theory. Table II shows the values of corrections to az required 
to best approximate the true offer @ for 20 days of data, for the four 
selected last-trunk CCS. A brief study indicates that when the busy 
season includes fewer than 20 days, the required correction is somewhat 
smaller. 
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Fig. 14—Observed corrections required in az to obtain improved. estimates of 
aitered load a, 100 high-usage groups in 3 tandems, Kildare, 1958, 3 hours for 20 days. 
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Fig. 15—Practical correction of az to estimate d, with dependence on engineering 
level of load on “last trunk.”’ 
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Fig. 16—Comparison of theory and data in corrections of az to estimate d, intertoll 
data at Memphis, 1957-1958. 
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TasLE IJ—AVERAGE CORRECTIONS TO BE APPLIED TO a; TO OBTAIN 
IMPROVED ESTIMATES OF G@ 
(€ determined as 20-day average) 


No. Trunks Correction Factors for y CCS on the Last Trunk 
y= y = 14 y = 20 y = 25 
1 1.16 1.14 1.11 1.07 
2 1.10 1.09 1.08 1.06 
3 1.07 1.07 1.07 1.05 
4 1.05 1.06 1.06 1.05 
5 1.04 1.05 1.05 1.04 
6 1.04 1.04 1.05 1.04 
7 1.03 1.04 1.04 1.04 
8 1.03 1.04 1.04 1.04 
10 1.03 1.03 1.03 1.03 
12 1.02 1.038 1.03 1.03 
15 1.02 1.03 1.03 1.03 
20 1.02 1.02 1.02 1.02 
25 and up 1.01 1.02 1.02 1.02 





2.5 Estimation of Average Offered Loads from Average Overflow Ratios 

To monitor the level of traffic flow in some systems, a pair of 
registers which count numbers of calls offered m, and numbers of calls 
blocked n, are provided instead of the more common ‘carried load 
meters. The value n/m is called the overflow ratio. Descloux? has 
shown that, under the condition that the observation period is long, 
say 20 holding times, little bias exists in estimates made of individual 
hourly offered loads to high-usage groups when a is determined from 
the Erlang loss relation 


E,.,(a) = n/m. (12) 


To increase the reliability of such estimates, the overflow ratios for 
a number of hours are commonly averaged and entered as for a single 
hour in equation (12). We inquire whether the estimate of average 
offer, a7, , So obtained is unbiased, or requires correction. The expected 
loss probability when a load a, varying from hour to hour according 
to 6(a), is submitted to x trunks is 


Bo = [ Ey .(a)6(a) da. (13) 


When a constant load equal to the average load @ is offered to x trunks, 
a loss probability H’ will result, | 
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i’ = E,,,(d). (14) 
In general FH will be different from F’. 


Suppose that actual busy hour overflow ratios are observed over 
a period of days, and averaged giving n/m. If @ is estimated as 4, 
through equating n/m to E in equation (13) we should expect a, to 
be unbiased since both n/m and / contemplate the presence of day- 
to-day variations, the first in reality, and the second by the inclusion 
of @(a) in equation (18). However estimating a@ through substituting 
n/m for E’ in equation (14) will yield a7, , containing a certain amount 
of error since it merely determines the single hour load corresponding 
to the average loss. 

A range of offered loads to trunk groups of size 1-24, covering the 
cases of 8 to 18 “CCS on the last trunk,” has been studied and 4a,/az, 
plotted against the numbers of trunks. This is shown in Fig. 17. For 
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Fig. 17—Correction required in offered load estimated from averaged n/m 
readings. 
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one and two trunks the estimation procedure based on equation (14) 
considerably underestimates the average offered load; but beyond 
four trunks, overestimates of the true average offer are generally made. 
The cause of the inversion in the correction required lies in the shapes 
of the Erlang load versus loss curves in the ranges of interest. The 
one- and two-trunk curves are predominantly concave downwards 
(see for example Tig. 1), while for the larger trunk numbers the curves 
are concave upwards. Only for the one-trunk case are the corrections 
sufficient in most practical situations to warrant their making. Here 
a median correction is about 1.18. 

Available data for groups having call and overflow counters, while 
showing considerable dispersion, confirm the shape and location of 
the correction curves of Fig. 17. 


III. GROUPS WITHOUT ALTERNATE ROUTES 


There has long been interest in the different theoretical formulas 
used by American and European administrations for engineering inter- 
local trunk groups. The former have relied on the cumulative Poisson 
while the latter have favored Erlang’s loss formula. What do data 
taken on such groups indicate? 

In 1959 data were taken on some 80 direct trunk groups terminating 
in the Arlington, Massachusetts, No. 5 crossbar office. Hourly observa- 
tions comprised numbers of calls offered to each group, the number 
blocked, and, by switch counting, the average load carried. Illustrative 
of these is group No. 26 with 32 trunks observed for 4 hours a day for 
29 business days. Offered loads were estimated for each hour by equa- 
tion (1). The load versus blocking relationship observed for 116 hours 
is shown in Fig. 18. Superimposed is the Erlang loss relation. The 
agreement is seen to be very good. 

On Fig. 19 are shown only the twenty-nine 10-11 a.m. busy hour 
loads of Fig. 18. The agreement with Erlang’s theory is again under- 
standably excellent. Other group data comparisons were nearly as 
satisfactory. We are led to conclude that Erlang’s loss formula de- 
scribes quite well the hourly blocking for conversation traffic on direct 
groups which do not have alternate routes. Apparently the return of 
blocked calls here was of a nature that caused little disturbance of 
the Poisson character of the offer—or perhaps they contributed to it! 

In the lower section of Fig. 19 is shown a distribution of the busy 
hour loads offered to group No. 26 as they varied from day to day 
over the 6 weeks busy season. For the average offer of @ = 21.37 
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Fig. 18—Comparison of load vs loss data with Erlang’s theory: full group No. 
26 of 32 trunks, Arlington, 4 busiest hours of each day for 6 weeks, 1959. 


erlangs, a variance Var(a) = 20.3 was observed. Since the probability 
of loss curve in this range is concave upwards, the average loss for 
all the hours tends to exceed the loss at the average load. Thus the 
theoretical single hour loss at 21.37 erlangs offered is 0.0074. However 
the observed average loss was found to be 0.0197 when hourly losses 
were given equal weight (as in American practice). (If the losses had 
been weighted by the corresponding offered loads, the average loss 
would be still greater at 0.0267.) 

Clearly, if Erlang’s formula describes well the losses for individual 
hours, it will not usually give an adequate estimate of the average loss 
over a series of busy hours. However, when the more conservative 
Poisson summation values, 


i] a 


P(e, a) = ys ’ 


r=2 r! 





are laid on Fig. 19, they are seen to pass directly through the (un- 
weighted) average loss point, indicating that at least for the amount 
of day-to-day variation present in this sample of hours, the Poisson 
provides just the right amount of “improvement” to Erlang’s loss 
formula. 
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Fig. 19—Comparison of load vs loss data with Erlang’s theory: full group No. 
26 of 32 trunks, Arlington, 10-11 A.M. busy hour each day for 6 weeks, 1959. 
(Numbers at points are hours averaged.) 


There is then a particular variance of day-to-day busy hour loads 
which when applied to Erlang’s loss formula will just produce the 
average loss of the Poisson formula. Figure 20 shows a field of curves 
indicating the Var (a) required for average loads, a, such that at the 
loss levels given, the Poisson summation will closely relate the average 
busy season busy hour loss to the average offer. On the same field are 
shown the actual variances versus averages observed for the trunk 
groups in the Arlington study mentioned above. It is seen that the 
variances found in practice are such that at commonly used interlocal 
average busy season busy hour grades of service, 0.005 to 0.03, the 
Poisson summation provides an excellent means of specifying group 
average capacity. It may be noted that in American practice, the 
objective grades of service are set to be met by the average (unweighted) 
blocking in the busy season busy hours. 

The dashed line on Fig. 20 is a centrally located curve which describes 
the general relation seen between day-to-day variance and average 
load offered. It is interesting — and reasonable - that its equation is 
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Fig. 20—Variations in time consistent hourly loads on 32 direct groups of 
trunks: Arlington, 4 busiest hours of each day for 6 weeks, 1959, compared with 
“Poisson assumption.” 


Var (a) = 0.13 @ where r = 1.58, identical with that of the central 
axis line drawn through the high-usage group observations on Tig. 10. 
The dotted line on lig. 20 results from choosing r = 1.50; it corresponds 
nicely with Poisson blocking of 0.01 for loads of 10 to 100 erlangs.* 


*It may be noted that Bell System traffic capacity tables for nongraded groups 
carrying random and nonrandom offers have been generated at several blocking 
levels, and contemplate the following hour-to-hour load variations: Single hour 
blocking; “low” hour-to-hour variation (r = 1.50); “medium” hour-to-hour 
variation (r = 1.70); “high” hour-to-hour variation (r = 1.84). At loads of one 
erlang, these several traffic tables contain the same allowance for day-to-day 
variations. Below one erlang the theoretical day-to-day variations, while inverted 
from their relative positions at loads above one erlang, are small compared with 
the momentary variations inherent in random traffic. 
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IV. FINAL GROUPS 


Much has been written in the past on estimation of the effect of 
the nonrandomness (peakedness) of overflow traffic from high-usage 
groups upon the trunking requirements in final groups.’ Studies have 
also been made (as in the previous section) showing the added trunks 
required to accommodate the increased demand on groups whose 
offered loads show significant day-to-day variations. 

The two effects will usually appear simultaneously in the engineer- 
ing of final groups. Moreover, loads offered to final groups may be 
expected to show generally larger day-to-day busy hour variations 
than do loads to high usage and direct groups not having alternate 
routes. Figure 21 shows a field of variance versus average loads 
offered to 28 interlocal final groups in which negligible first-routed 
traffic was present. A central fitting line, having the equation 


logio Var (a) = 1.8404 logi, d — 0.8861, (15) 


has been drawn through the points, or equivalently, 


Var (a) = 0.134". (16) 


When this variation is assumed, the carrying capacities of 6 to 15 


1000 


FITTING LINE IS 
LOG VAR (a) = 1.8404 LOGip a-0.8861~~ 
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Fig. 21—Variations in busy hour loads offered to final groups in 28 interlocal 
alternate route systems. 
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trunks at an 0.02 average blocking are shown illustratively on Fig. 22. 
For 10 trunks, for example, the erlang capacity for random traffic is 
reduced from 5.08 to 4.47 erlangs, a drop of 12 percent resulting from 
the day-to-day variations. 

Similarly, if day-to-day variations are not introduced, but instead 
a nonrandomness, characterized by a peakedness factor P.F. = 
(variance) /(mean) = 1.5, is assumed, the capacity of 10 trunks is 
reduced from 5.08 to 3.96 erlangs, a reduction of 22 percent. 

When day-to-day variations and nonrandomness are jointly intro- 
duced, the capacity of a group is further reduced; thus the 10 trunks 
will now accommodate an offer of only 3.57 erlangs, a reduction of 
30 percent from the basic Erlang loss formula value. It is clear that 
each cause can produce substantial reductions in trunking capacity. 

When the offer to a final group contains a significant fraction of 
first-routed traffic, both the peakedness factor and the day-to-day 
variations will commonly be lowered; the trunking requirements will 
then be correspondingly reduced on both accounts. 

An illustration of the importance of using theory which considers 
both day-to-day variations and the peakedness of the traffic is given 
in Fig. 23. The results are shown of observations on a Kildare final 
group of 59 trunks, with 62 subtending high-usage groups plus 4 first- 
routed traffic items. From high-usage group load and trunk configura- 
tions the estimated peakedness factor of the final group offered load 
is 1.93. The load-loss characteristics observed for the 20 days are: 





Hour Average Day-to-Day Average 
of Offer Variance of Blocking 
Day (erlangs) -the Offer Observed 
9-10 a.m. 26.5 28.5 0.001197 

10-11 a.m. 44.0 180.3 0.04570 


As seen on the figure, the simple Erlang loss values (dotted lines) are 
an order of magnitude below the observed losses. Although allowance 
for nonrandomness (dashed lines) makes a marked improvement, it is 
still far from describing the actual losses. Nor is the Poisson model 
(dots on Fig. 23), which might be expected to compensate partially 
for day-to-day variations in the 0.01 to 0.03 loss range, a nearly 
sufficient improvement. But employing the typical day-to-day vari- 
ance magnitudes of Fig. 21 in conjunction with the estimated peak- 
edness of the offer, the solid line load-loss curves for 58 and 60 trunks 
exactly bracket the average loss values seen for 20 days for the 9-10 
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Fig. 22—Final trunk group engineering with various assumptions as to the 
character of offered loads. Average blocking = 0.02. 


A.M. and the 10-11 a.m. hours on the 59-trunk final group. Construc- 
tion of such load-loss relations is described in Ref. 4. 


Vv. SUMMARY 


Examples have been given comparing observation, simulation, and 
theory in various areas of traffic flow on trunks comprising direct and 
alternate routed plans. Particular attention has been drawn to the 
need for developing adequate relationships between offered, carried, 
and overflow loads, both single hour and average busy season, suitable 
to each operating condition. Where possible, a physical understanding 
of the principal factors is followed by statistical theory which may re- 
quire approximate numerical calculations. Further insight may be 
gained from controlled simulations. In order for the relationships de- 
veloped to be useful, they must finally be found to agree with the flow 
of traffic in real situations. 

The following particular results have been determined: 


(1) Single hour load-loss relationships on high-usage groups are 
found to be well described by Erlang’s E,,, (a) loss formula. Com- 
parisons are made with data and simulations (Figs. 1, 2, 3, 4). 

(72) Estimating single hour loads offered to high-usage groups from 
observed carried loads by use of Erlang’s loss relationship yields too- 
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large values at the heavier occupancy levels, and slightly too-small 
values at low occupancies. This is explained and illustrated by re- 
course to regression theory (Fig. 5). The magnitude of the corrections 
needed are indicated in Figs. 6, 7, 8. 

(izi) If busy season busy hour carried loads are first averaged and 
this value entered in Erlang’s E,-relation the average offer will be 
underestimated. Corrections required for a 20-day busy season are 
given in Table II and Figs. 13, 14, 15, 16. 

(iv) Estimates of the average offer obtained from averaged hourly 
blocking proportions show that both positive and negative corrections 
may be required (Fig. 17). Except for the 1-trunk case, however, the 
corrections indicated are small. 
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Fig. 23—Comparison of blocking theories with performance of Kildare final 
group No. 1, 62 high-usage groups plus 4 first-routed items, 59 trunks, P.F. = 
1.93, 20 days, October-November 1958. 
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(v) When a series of busy hour loads is submitted to a high-usage 
group, the true average overflow load will exceed that estimated by 
entering Erlang’s relation with the average offer. Corrections are given 
in Table I and Fig. 11. 

(vz) For full groups (i.e., those without alternate routes) it is found 
that Erlang’s formula describes well the load-loss relation for single 
hours (Figs. 18, 19). 

(vit) When busy hour loads offered to full groups are averaged for 
the busy season, the corresponding average loss will usually be better 
estimated by the Poisson summation formula, if the loss values are 
in the common range from 0.005 to 0.03 blocking. 

(v2) Loads offered to final groups will normally be nonrandom 
(peaked) and hence require special procedures for engineering. Ex- 
amples are given in which both peakedness and day-to-day busy hour 
variations are present. Unless both influences are allowed for, the 
average load-average loss estimate may be far from that actually 
observed (Fig. 23). 
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When Are Transistors Passive?* 


By B. GOPINATHt? and D. MITRAt 
(Manuscript received December 8, 1970) 


The paper presents results on the stability and dynamic behavior 
under large signal conditions. of networks consisting of transistors and 
sources connected to a linear, passive, memoryless subnetwork. The tran- 
ststors’ model incorporates various nonlinearities. A characteristic common 
to the main results of the paper is that they relate to properties of the tran- 
sistors alone and, hence, are independent of the passive part of the network. 

Sufficient conditions are obtained for asymptotic and bounded input- 
bounded output stability. The conditions impose restrictions on some of 
the physical constants of the transistors’ model. These conditions have 
an interesting physical interpretation in terms of temperature differentials 
in the transistor junctions. In particular, any transistor with the expo- 
nential type of static diode characteristic ts passive only tf the ratio of the 
junction temperatures lies inside an interval determined by the a’s. 

In the state space of the network there exists a well-defined region R 
specified by the transistors’ model with the property that constant terminal 
states in R are independent of initial conditions. The region R is in a 
certain sense maximal. 


I. INTRODUCTION AND DERIVATION OF THE DIFFERENTIAL EQUATION 


The network considered is shown in Fig. 1 and it consists of a number 
of transistors connected to a subnetwork composed of voltage sources, 
current sources, and linear, passive, memoryless devices such as resistors 
and transformers. Sandberg’ has analyzed the dynamic behavior of such 
networks and has defined a class which exhibit various features of 
stability. The results of this paper are essentially different in that they 
relate to properties of the transistors alone and, hence, are independent of 
the passive part of the network. 

The analysis is concerned with certain network-theoretic properties of 


* This paper was presented at the 1970 IEKE International Symposium on Circuit 
Theory, Atlanta, December 1970. 
t The sequence of names was chosen by flipping a coin. 
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TRANSISTOR 1 TRANSISTOR 2 





Fig. 1—General network containing transistors, sources, resistors, and transformers. 


transistor models that are assumed in extensively used Network-Analysis 
Programs, such as NET 1, CIRPAK, CIRCUS, ete., (see Ref. 2). The 
intent of the paper is to study the large signal behavior of transistor 
networks using this model and certain generalizations of it. Each transis- 
tor is represented by a model of the type shown in Fig. 2 which takes into 
account the nonlinear de properties as well as the presence of nonlinear 
junction capacitances. Six parameters are associated with the model: 
Q1, M2, T1, T2; €1, aNd Cz, all of which are positive; also a,, a2 < 1. The two 
nonlinear static diode functions are denoted by f,(-) and f.(-). Initially 
it is assumed that f,(-) and f.(-) are monotone, strictly increasing 
mappings of the interval (— ©, ©) into itself; f,(0) = f.(0) = 0, and 
fi(-) and f.(-) are continuously differentiable on (— ©, ©). Further 
assumptions are made about f:(-) and f.(-) in the course of the paper. 

Suppose the network has n transistors and with the polarity indicated 
in Fig. 2 let vo;-1; and v2; respectively denote the emitter-to-base voltage 
and collector-to-base voltage of the zth transistor. Similarly, let 7;_, 





Fig. 2—Transistor model. _ 
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and 72; respectively denote the emitter current and the collector current 
of the 7th transistor. An identical scheme of subscripting is used to 
define f;(-), c:;, and 7; for? = 1, 2,--- , 2n. v, 2, and F(-) are 2n-column 
vectors formed by arranging {v;}, {7;}, and {f,;(-)} respectively. In 
applying the current law to the transistors’ model it follows that 


i= £0) + TF) (1) 


where 
[Co)]; = 0; + rifi@:) 
and 7 = 7,@T7.@--- @T, the direct sum of n 2 by 2 matrices 7; in 


which 
Tl = 1 Bs 
—ai 1 


The subnetwork to which the transistors are connected is assumed to 
impose a constraint of the form 


i=-Gt+b (2) 


forj7 = 1,---,”. 


in which G is the conductance matrix and hence G = 0,* and 6 is an 
element of all real bounded continuous 2n-vector-valued functions of ¢ 
on [0, ~). 

From (1) and (2), 


< CO) TRO) Ay SB, (3) 


The above equation is also derived in Ref. 1. Since all of the c; and 7; 
are positive and each of the f;(-) is continuous and monotone increasing, 
C;'(-) exists; obviously C™’(-) is also strictly monotone increasing 
and C-*(0) = 0. 


Il. ASYMPTOTIC STABILITY FOR THE UNFORCED SYSTEM 


In this section we show how to prove an intuitively reasonable result 
concerning the asymptotic stability of the system. A Lyapunov function 
with a simple energy interpretation is introduced and used to prove 


* G@ 2 0 indicates that G is a positive semidefinite matrix. Unless otherwise stated, 
G will only be assumed to be positive semidefinite. 
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stability. Let 


2n Ci(vi) 
LO* Lf creas. (4) 

i=1 0 
Since C;(-) defines the charge-voltage relationship in the nonlinear 
junction capacitances, L(v) is the total stored electrical energy in the 
network. It is easily verified that (2) L(v) = 0, equality holding only 
when v = 0, and (zi) L(v) > © as ||v|/ > ©. Thus L(v) is a Lyapunov 
function if (d/dt)L(v) < 0. For the unforced network 


£ Lt) = —v'TF) — v'Gv. (5) 


Clearly if o' 7F(v) = 0 with equality holding only if v = 0 then L(v) is a 
Lyapunov function since G has been assumed to be positive semidefinite. 
It then follows from a well-known result in stability theory’ that 
v(t) ~Oast— ~, 

Note that for constant v the term v'TF(v) expresses the net power 
flow into the transistors and if all the transistors are passive then 
certainly v'7'F'(v) > 0. However it is shown later that in certain abnormal 
conditions the transistors are not all passive and the judicious choice of 
the passive network allows one to extract energy from the transistors. 
To show this we first prove the following: 


Lemma 1: Given two real 2n-vectors x and y such that x'y > 0, there exists a 
G > 0 for which the following holds 


y = Gz. (6) 


Proof. Define a 2n by 2n — 1 matrix Z such that the columns of Z span 
the (2n — 1)-dimensional subspace orthogonal to z. Then Z'z = 0. 
Since by assumption y is not orthogonal to z, y is not an element of the 
range space of Z and, hence, the columns of Z together with y span E””. 
For these reasons the following construction of G, 


1 t t 
Cea, + ZZ, (7) 


suffices to prove the lemma. 
When for some x1, , v;TF(v,) < 0 then by Lemma 1 there exists a 
G > 0 such that 


TF(v,) + Gr, = 0. (8) 


Hence with initial condition v(0) = »v, the solution of the network 
equation (3) with b = 0 is, 
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v(t) =» (9) 


and the system is not globally asymptotically stable. 

The above results may be summarized in the following: 
Theorem 1: If v'TF(v) > 0 Vv # 0 then the network is globally asymp- 
totically stable. Furthermore, tf there exists a v, such that viTF(v,) < 0 
then there exists aG > 0 for which the system 1s not globally asymptotically 
stable. 


2.1 Sufficient Conditions for Asymptotic Stability 


Consideration is given to the positivity of v'7'F(v). Due to the quasi- 
diagonal structure of 7 we need only consider in detail the behavior of 


ce . (10) 
fos(os) 
For notational convenience ai, a3, V2;-1, V2) fei-i(-), and fe;(-) are 
respectively denoted by ay, a, 01, V2, fi(-), and fo(-). 

Note that when v,v. S 0, g(v) 2 0 and g(v) = 0 only if », = v». = 0. 


Let v; ¥ 0, v2 = pv, and p = 0. Then 
gv) = uf1(V1) — avife(v2) + Vefe(v2) — oadef (v1) (11) 


1 —a} 
pVo5-1 y Veo8) a (25-1 a] 


t 
—> 





and 

g(o) & OO = 1 — pas) + BAD (p — a), (12) 
If p = a, then g(p) > O since 1 — aia, > 0. If p ¥ a1, g(p) = Oif and 
only if 


es pOz _ f2(pv:) 
a1 — p fils) ug 


(1 — paz)/(a, — p) is pee as a function of p in Fig. 3. The salient 
features of the function for p 2 0 are: 
i — pao J 


=— for OSp<aQq 
a, — p Oy 


A 


<0 _ for erp: (14) 
2 


<a, for A gee es 
a2 


Now [f2(pv,)/f:(v1)] is a monotone, strictly increasing function of p and 
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1-pas 





a1-p 





Fig. 83—Plot of (1 — paz)/(a — p). 


[fe(ov1)/fi(v1)]|[,-0 = 0. Clearly if 


fo(ais) < 1 





and h@) ~ a Vu, #0 (15) 
ii 
on) 
fi@) =~? 


then no solution of (13) exists. Since g(p) is continuous in p, g(0) > 0, 
and g(p) 0 V p > 0, it follows that g(p) > 0 V p = O. Hence sufficient 
conditions for the positivity of v'TF(v) are: 


foi (oeiv) < 1 
fos-i() mae 
foi(v) San! 
fos—1(agv) aes 


Remark: 


and Vo OF cand) 4-12, oes 5 (16) 


fos(ov) ~ 1 — pa 
fos—1@) Qi — p 
1 — pa 
Q; — p Q2 


Vv+#0 (17) 
> 


is both necessary and sufficient for v'TF(v) > 0. However, the practical 
value of such a condition is limited. 
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2.2 Necessary and Sufficient Conditions for the Exponential Type of 
Nonlinearities 


The functions f;(-) in the conventional Ebers-Moll model and in the 
charge-control model’ are of the following form: 


f:(v) = a; (exp rA,v — 1). (18) 


Such exponential nonlinearities are subsumed in a class of nonlinearities 
with the following properties: 


(t) f:(v;) = afQ.v,), 2 = 1, 2, --- , 2n, where a; and \; are positive 
constants. 
(zi) f(-) is a monotone, strictly increasing function. 
(zz) f(0) = 0. 
(iv) lim,._. fv) = —1. 
(v) lim,.. [f(pv)/f(v)] = 0 for all p where 0 < p < 1. 


When the transistors’ static diode characteristics fall under this class 
then the following is true: 
necessary and sufficient conditions for v'TF(v) = 0 (v'TF(v) = 0 
only when v = OQ) are 
we Goi-1  Aoi-1 < nf (19) 


a 


) — 
Q2i Noi Q2 


fort = 1,2,--+,n. 

Sufficiency follows from the definitions and (16). For the necessity 
part consider the case when v2;-. ~ — © with the remaining 2n — 1 
variables fixed. It is necessary that the coefficient of v2;_, in the expansion 
of v TF (v) approach a nonpositive value, i.e., 


lim (f2i-@) — aife(r)} < 0 (20) 
fora = 1,2, --- ,n. 
Similarly, 

jim {fei@) — afo:rQ~)} <0 (21) 
for? = 1, 2, --- , n. Hence 


—@2;-1 + aja; S 0 


and, 


— 2; + a242;-1 S 0 
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or 
poi: gee 1 

os Ss (22) 
Qe; Qe 

fort = 1, 2,--- ,n. To obtain the remaining conditions consider equation 


(12) in the notation of this section. By property (v), 


: fos(pv2s-1) 
] —_o os = O0 
oe fos—1(Voe-1) 


Noi-1 





for all p such that p < and @=1,2,--: ,n. 


2t 


Hence, a necessary condition for the positivity of v'TF(v) is: 1 — pai > 0 
is implied by p < Ax;-1/Aai 5 L., 


Noi-1 1 
cA LEE peee 
Nei = Qs (23) 
for 7 = 1, 2, --- , n. Repeating the argument with the roles of v,;-, and 
Vo; Interchanged yields 
ge (24) 
Noi 
for 7 = 1, 2, --- , n. This concludes the proof. 


Remark 1: It is clear from the derivation that the conditions stated 
in equation (19) with the ratios replaced by the appropriate limits 
are necessary for the positivity of v‘'TF(v) when {f;(-)} are general 
monotone, strictly increasing functions. 

Remark 2: If condition (19) is violated then there exists av, , || v1 || << ©, 
such that vj TF (v,) < 0. The proof is as indicated below. 

Consider equation (12) and say ),/A2 > 1/a2; then for p such that 
1/az < p < i,/dz it follows from property (v) that there exists a V such 
that for v; > V, g(p) < 0. Likewise, for a;/az > 1/az and p(p > 0) such 
that (1 — a,(a2/a,)) + p(a2/a, — a2) < Git follows from property (zv) 
that there exists a V such that for v, < V, g(p) < 0. For the remaining 
possibilities the proof follows by interchanging the roles of v, and v2 in 
the definition of g(p) and proceeding as above. 


IlI. PASSIVITY 


This section is devoted to a discussion on condition (19) which has 
been shown to be both necessary and sufficient for the passivity of 
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transistors for which the static diode characteristics are of the form 
f:(v;) = a,(exp xv; — 1) (25) 


in which a, and \; are positive constants. When the temperature in the 
transistor is uniform, then from the well-known physical model,’ \; = 
(q/KT) where q, K, and T are respectively the electron charge, Boltz- 
man’s constant, and temperature. However, when a temperature dif- 
ferential exists in the transistor junctions, for instance in a p-n-p 
transistor the temperature in the neighborhoods of the p-n and n-p 
junctions are respectively 7, and T,, then the same physical model 
holds® and 


210): aan 
Nea = Rp and dn = Ee 
so that 
Aoi _ T's 
ag wo) 


Hence when the ratio of the temperatures at the base-collector and 
emitter-base junctions of at least one transistor, say the 7th, lies outside 
the interval [aj , 1/a3] then there exists a 2n-vector v, such that vf TF(v,) 
< 0. Furthermore, by Lemma 1, there exists a positive definite matrix 
G such that the solution of the network equation (3) with b = 0 and 
initial condition v, is v(t) = v, . Then, 

power delivered to | mip sop) SAN: 

passive subnetwork 


The temperature dependence of the a,’s may similarly be exploited to 
deliver power. The phenomenon of differential heating of transistors as 
described above is the basis of the thermocouple effect and thermo- 
electric generators.” 


IV. BOUNDED INPUT—-BOUNDED OUTPUT STABILITY 


It is shown in this section that for a positive definite matrix G, bounded 
inputs, and passive transistors there exists a bounded neighborhood of 
the origin which, loosely speaking, is sure to contain the forced response 
of the network. The norm used is the Euclidean norm. 

Now 


vTFv) >0 Vv #0 (27) 
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and let 
[| b@ || = K. (28) 
L(v) is as defined in equation (4). For the forced system, v ¥ 0, 


£ 16) = —y'TF(v) — v'Gv + 0'b 
< —v'Gv + |o'd | 
< -v'Gv + K ||v || by Schwarz’s inequality (29) 
SS Naia ||| 0 | ae ||| 


where Amin 18 the smallest eigenvalue of G and is positive by assumption. 
Hence when ||v|| 2 K/min, (d/dt)L(v) < 0. Thus there exists a 7 such 
that fort > T, 


Io I< 


Vv. A PROPERTY OF TRANSISTOR NETWORKS 


Let R = {v' | (o’ — v)'TIFO’) — F(v)] > 0 Vu  0’}. Then: (2) if G 
is positive definite, v’ and b’ are constant vectors such that TF(v’) + 
Gv’ = b’, and v’ « R, then for all inputs which approach b’ as t > o, 
the state vector approaches v’ independent of initial conditions; (77) there 
exists a passive subnetwork for which the overall network is such that 
if the terminal state corresponding to an input approaching a constant 
vector is not in R, the closure of R, then there exists another input 
approaching the same constant vector for which the terminal state is 
different. 


Remark: It has been pointed out by Sandberg’ that the independence of 
the steady state from initial conditions is a basic property, and in Ref. 1 
it has been proved that if (7, G) belongs to a certain class then the region 
RF in statement 7 extends to the entire state space of the network. How- 
ever, the region R defined here is independent of G. 

In switching circuits with “memory” the dependence on initial 
conditions is a salient feature. Statement (7) observes that the design of 
such circuits be such that the steady states lie outside the region R, i.e., 
the steady state bias voltages violate the conditions which define the 
region R. Of course, the fact that the transistors are passive merely 
implies that R is non-empty, and their use in switching circuits with 
“memory” is not precluded. Statement (77) states that in a certain sense 
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the region R is maximal. 
Proof: (2) 
TF(v’) + Gu’ = bd! (30) 


and 
4 O) + TF) + Gv — vf =“ (C@) — Cw} 


+ T{FQ) — F’)} + Gv — 0") 


= b(t) — b’ (31) 
from (3) and (30). Define 
pb2v-v71’, (32a) 
C@) = Ce) — CO’), (32b) 
and 
F() = FW) — Fv’). (38) 
From (81), 
£ G@) + THO) + Go = WH) — BY. (34) 
Note that C;(-) for? = 1, 2, --+ , 2n are monotone, strictly increasing 


mappings of the real interval (—, ©) onto itself and C(0) = 0; 
also there exists C7*(-). Since v’ ¢ R, o‘'TF(o) > 0. Also by assumption 
for any « > 0 there exists 7 such that 


\|b@) — b' || S Amin € for t= T (35) 


where Amin is the smallest eigenvalue of G. On applying the results on 
bounded input-bounded output stability, as stated in Section IV, to the 
system described by equation (34) it follows that there exists a T’ such 
that fort > T’, 


Amin€ 


Ha | = Io = of I] <® 





= € (36) 


min 


and the proof is complete. : 
(77) It will be shown that if v,, ¢ R then there exists a G > 0 such that 


TF (Vo) + G5 = de (37a) 
TF(v1) + Gr, = de , (37b) 
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and 
Vn FEU. (37c) 


In (87a) v. denotes the terminal vector corresponding to an input 
approaching the constant vector b,,. The solution of the network equa- 
tion (3) with b = b,, and initial condition », is v(t) = 1. 

The proof is by simply observing from the definition of R and Lemma 1 
that there exist v; and G > 0 such that, 


T{F(v.) — F(v')} + Go, — v') = 0. (38) 


At least when the static diode characteristics are of the conventional 
exponential type the region F is easily obtained as shown below: 


(v' — v)'T{F(v’) — F(v)} = o'TD, F(d) (39) 
where 
d=v'—v 
and 
D, = diag (exp Aw{ , exp Ash, - ++ , EXP AonV5,). 


The necessary and sufficient conditions for the positivity of 6’TD,-F(d) 
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Fig. 4—Region R for the 7th transistor (exponential diode characteristics). 
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are, from (19), 





: ) eae @oc) OXD Ogee 1 
pee. aims EXP (hoi—W2i-1) Zs (40) 
ho: Gz; EXP (AziV2,) Qe 
forz = 1, 2, --- , n. (40) is equivalent to 
aig Meret 
Qi Oe 
(41) 
Qo; 4 2; 
In casi s (Nos 5-1 = Noi) ss n{ ? ; 
Qei-1 Q2;-102 
for? = 1, 2, --- , n. The region RF is plotted in Fig. 4. 
REFERENCES 
1. Sandberg, I. W., ‘Some Theorems on the Dynamic Response of Nonlinear Tran- 


2. 
3. 


sistor Networks,’’ B.S.T.J., 48, No. 1 (January 1969), p. 35. 
Shichman, H., “Integration System of a Non-linear Network Analysis Program,”’ 
IEEE Trans. Circuit Theory, August 1970, pp. 378-386. 
LaSalle, J. P., and Lefschetz, 8., Stability by Liapunov’s Direct Method, New York: 
Academic Press, 1961. 


. Koehler, D., ‘The Charge-Control Concept in the Form of Equivalent Circuits, 


Representing a Link Between the Classic Large Signal Doide and Transistor 
Models,”’ B.S.T.J., 46, No. 3 (March 1967), pp. 523-576. 


. Feynman, R. P., Leighton, R. B., and Sands, M., The Feynman Lectures on 


Physics, Quantum Mechanics, Vol. III, Reading, Mass.: Addison-Wesley, 1965. 


. Gummel, H. K., private communication. 


Copyright © 1971 American Telephone and Telegraph Company 
Tue BELL SYSTEM TECHNICAL JOURNAL 
Vol. 50, No. 8, October, 1971 
Printed in U.S.A, 


A Fast Amplitude Approximation 
For Quadrature Pairs 


By G. H. ROBERTSON 
(Manuscript received April 13, 1971) 


This paper gives an algorithm from which an approximation to the 
amplitude of a quadrature pair can be obtained stmply and without re- 
quiring more bits than are needed to represent the largest allowable am pli- 
tude, A. 


If A cos 6 and A sin 6 are a quadrature pair, the amplitude 1s 
A = (A’ cos’ 6 + A’ sin’ 6)'. 


If AX ts the larger magnitude of the pair and AY the smaller, an ap- 
proximation to A is 


Ar = AX 424A. 


Computations using this algorithm with random values of @ showed that 
when the true amplitude was A, the mean obtained was 1.087 A, with stand- 
ard deviation 0.031A. 

When A consists of a signal appreciably contaminated with noise so 
that several estimates are required before the signal can be detected reliably, 
no significant degradation («0.1 dB S/N) in detectability was found using 
the algorithm described above. 


I. INTRODUCTION 


When a discrete Fourier transformation is applied to samples from 
a data segment the results are usually desired in the form of phase 
and amplitude estimates of the components into which the data is 
resolved. When FFT" algorithms are used to perform the transforma- 
tion the information from which these estimates are obtained is 
usually produced in the form of quadrature pairs of numbers A cos 0, 
and A sin 6, where A is the amplitude and 6 the phase of the estimate. 
Obviously 


A = (A’ cos’ 6 + A’ sin” 6)? (1) 
2849 
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The intermediate step in this computation of squaring the com- 
ponents of the pair requires in general twice as many bits as are 
required to represent A. Computing the squares requires appreciable 
time as does the final square root computation. The situation is ag- 
gravated by the fact that there are as many pair members as the 
original data samples. 

This paper describes the use of an algorithm which gives an 
approximation that in many if not all cases is adequately accurate 
and greatly reduces the computation time as well as promoting effi- 
ciency in the use of hardware by not requiring more bits than are 
needed to represent the final estimates. 


II. DISCUSSION 


Let the larger of cos?6, sin?0, be represented by X? and the smaller 
by Y?. Then 


2\h 
A=AX (1 - z) 
i Game ie Ga 
= 4x (1455-335 ake (2) 
where 1 2 Y/X. 
An approximation to A is thus 
3 Y 
A’ = AX + 4AY (2) ; (3) 
which suggests the computationally more attractive form 
A’ = AX + HAY. (4) 


Note that AX and AY are magnitudes and (4) implies that the ampli- 
tude of a Fourier component is approximately the sum of the larger 
and half the smaller of its quadrature pair magnitudes. 


III. EVALUATION 


Curve (a) in Fig. 1 shows a plot of (4) as 6 ranges from zero to 
2/4 where the curve is reflected to 6 = 7/2. The pattern is then re- 
peated for the remaining three quadrants. The corresponding plot for 
(3) is shown in curve (b). 

From the samples displayed on this figure the mean and standard 
deviations were calculated and found to be: 
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For (3), mean = 1.01, standard deviation = 0.0157, and 
for (4), mean = 1.087, standard deviation = 0.081. 


The algorithm described by (3) thus gives more accurate estimates 
(as expected) than that described by (4) which, however, is easier to 
implement and requires less computation. 

When the Fourier analysis is carried out in an attempt to detect 
signal components contaminated by noise it is important to determine 
how processing techniques affect the detectability of such signals. In 
order to check this a program was prepared to compute performance 
estimates when the algorithm based on (4) was used. These estimates 
were plotted on standard ROC curves? for a linear detector and no 
significant degradation in detectability was observed. Extensive com- 
putations using the proposed algorithm have produced spectrograms 
of noise-contaminated signals that are undistinguishable from those 
obtained conventionally. 

Use of the algorithm described in equation (4) results in a phase- 
dependent variation of the detector output. However, when the phase 
is uniformly distributed in the range (0, 27) the amplitudes are quite 
densely clustered around the mean because of the shape of the modu- 
lation characteristic. Eighty-five percent of the amplitudes lie within 
a range of about 0.4 dB. The accuracy obtained using the fast ampli- 
tude algorithm based on (4) may thus be adequate even when the esti- 
mates are not corrupted by noise. 


IV. CONCLUSION 


An algorithm has been described for computing the approximate 
amplitudes of Fourier harmonics from their quadrature pairs. The 
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Fig. 1—Algorithm modulation characteristics. 
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algorithm is fast and does not require the use of more bits than are 
required to represent the largest permissible result. Use of the 
algorithm does not significantly degrade the detectability of signals 
contaminated by noise, compared with use of a perfect linear detector. 
Estimates obtained using the algorithm are amplitude modulated 
depending on their phase but the total modulation range is less than 
1 dB. Representing the magnitude of the larger component of a quad- 
rature pair by AX, and the smaller by AY, the algorithm is 


A’ = AX + 3AY. 
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